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Preface 



This book contains revised contributions by the speakers of the 1st IEEE Workshop on 
Wireless-Communication Circuits and Systems, held in Lucerne, Switzerland, from 
June 22-24, 1998. The aim of the workshop was to apply the vast expertise of the 
CAS Society in the area of circuit and system design to the rapidly growing field of 
wireless communications. The workshop combined presentations by invited experts 
from academia and industry with panel and informal discussions. The following topics 
were covered: 

■ RF System Integration (single-chip systems, CMOS RF circuits), 

■ RF Front-End Circuits (CMOS RF oscillators, broadband design techniques), 

■ Wideband Conversion for Software Radio (A/D conversion issues, wideband sub- 
sampling, low-spurious A/D conversion), 

■ Process Technologies for Future RF Systems (Si, SiGe, GaAs, CMOS, packaging 
technologies), 

■ DSP for Wireless Communications (DSP algorithms, fixed-point systems, DSP for 
baseband applications), 

■ Blind Channel Equalization (adaptive interference suppression, design techniques, 
channel estimation). 

The workshop was a great success, with over 130 participants from 19 countries, 
from the U.S. to Europe and Asia, including a large contingent of participants from 
industry (60 %). Feedback from the participants showed that the carefully selected 
combination of tutorial-like lectures with lectures on specialized and advanced topics 
was a feature of the workshop that was particularly appreciated. Due to the relatively 
strong involvement of industry — both in the form of lecturers and listeners — a high 
level of discussion was attained in both panel sessions and informal gatherings. 
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By a stroke of luck, Philips Semiconductors, Zurich, Switzerland, celebrated their 
50th anniversary in 1998. With the well-deserved spirit of celebration and generos- 
ity that this anniversary triggered at Philips, and the goodwill and encouragement of 
the Board of Governors and Excom members of the CAS Society, the finances were 
guaranteed at an early stage. For this the organizers were very grateful. 

The format of this workshop and book has strongly been influenced by the AACD 
workshop series organized by J. H. Huijsing, R. J. van de Plassche and W. Sansen. 
The editors greatly appreciate the inspiration provided by this series. 

It is a pleasure to acknowledge the speakers and authors for making available their 
expertise. Our sincerest thanks go also to D. Arnold, M. Goldenberg, D. Lfm, F. Lus- 
tenberger, H. Mathis, and H. P. Schmid for helping in the preparation of the workshop 
and book. Moreover, D. Lfm, H. Mathis and H. P. Schmid were strongly involved in 
the technical editing of this book, which the editors very gratefully acknowledge. 

Zurich, September 1999 
Markus Helfenstein and George S. Moschytz 
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RF SYSTEM INTEGRATION 

Chris Toumazou 



Dept. Elect. & Electron. Eng. 

Imperial College 
Exhibition Rd. 
London SW7 2BT, U.K. 



1.1 INTRODUCTION TO THE FOLLOWING PAPERS 

The idea for this part of the book arose from a need to convey the intricacies of RF 
system integration in today’s wireless information system arena. The proliferation of 
portable communication devices has created a high demand for small and inexpensive 
transceivers with low power consumption. The Radio-Frequency (RF) and wireless- 
communications market has suddenly expanded to unimaginable dimensions. Sources 
predict that mobile telephony subscriptions will increase to over 350 Million in the 
year 2000. 

While this trend continues and challenges still exist, the RF practice at present is 
one which requires the so-called “green fingers” of design. System integration from 
the interconnection of sub-blocks at the front-end to the interconnection of metal lay- 
ers within the IC is all highly interactive. However, what has now become apparent at 
the low-GHz frequency band is that as feature sizes of silicon devices shrink, there is 
a more natural move towards VLSI at high frequencies where lumped circuit design 
is beginning to find a new home. As we start integrating more of the board-level com- 
ponents onto a single chip, the board layout will then also form a major part of the 
“lumped” circuit. 
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Part I has brought together key engineers from industry and academia to shed light 
on performance demands, board level design, and sub-micron silicon CMOS solutions, 
taking the reader through realistic design scenarios for RF system integration. 

Devices and systems, such as pagers, cellular and cordless phones, cable modems, 
mobile faxes, PDAs (Personal Digital Appliances), wireless LANs, and RF identifica- 
tion tags are rapidly penetrating all aspects of our lives, evolving from luxury items 
to indispensable tools. Semiconductor and system companies, small and large, ana- 
logue and digital, have seen the statistics and are striving to capture their own market 
share by introducing various RF products. Today’s pocket phones contain more than 
one million transistors, with only a small fraction operating in the RF range and the 
rest performing low-frequency “base-band” analogue and digital signal processing. 
However, the RF section is still the design bottleneck of the entire system. 

In contrast to other types of analogue and mixed-signal circuits, RF systems de- 
mand not only a good understanding of integrated circuits, but also of many areas that 
are not directly related to integrated circuits. 




Figure 1.1 RF design disciplines. 



Most of the areas shown in Fig. 1.1 have been studied extensively for more than 
two decades, making it difficult for an IC designer to acquire the necessary knowledge 
in a reasonable amount of time. Traditional wireless system design has thus been car- 
ried out at somewhat disjointed levels of abstraction: communication theorists create 
the modulation scheme and base-band signal processing; RF system experts plan the 
transceiver architecture; IC designers develop each of the building blocks; and manu- 
facturers “glue” the ICs and other external components together. In fact, architectures 
are often planned according to the available off-the-shelf components, and ICs are de- 
signed to serve as many architectures as possible, leading to a great deal of redundancy 
at both system and circuit levels. This results in higher levels of power consumption 
and generally lower performance. 
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Most recently, as the industry moves toward higher integration and lower cost, RF 
and wireless design increasingly demands more “concurrent engineering,” thereby re- 
quiring IC designers from both industry and academia to combine forces and to have a 
sufficient and integrated knowledge of all the disciplines [1]. RF circuits must process 
analogue signals with a wide dynamic range at high frequencies. It is interesting to 
note that the signals must be treated as analogue even if the modulation is digital or 
the amplitude carries no information. 




Figure 1.2 RF design hexagon. 



The trade-offs involved in the design of such circuits can be summarised in the “RF 
design hexagon” shown in Fig. 1.2. While any of the seven parameters trade to some 
extent, all these parameters are severely constrained by the core parameters, namely 
the power consumption and supply voltage. It is important to recognise that, while 
digital circuits directly benefit from advances in IC technologies, RF circuits do not 
benefit as much. This issue is exacerbated by the fact that RF circuits often require 
external components — for example, inductors — that are difficult to bring onto the chip 
even in modem IC processes. RF design techniques are thus becoming highly sensi- 
tive to device physics, and so analog characterisation of digital VLSI technology is 
of primary concern. One of the major challenges is implementing RF circuits on ICs 
instead of PCBs, offering advantages including lower production cost, high functional- 
ity, small physical size, high reliability, and low power requirements. It now becomes 
very necessary to achieve better co-ordination between the “system design” activity 
and the “RF circuit design” activity. 

A few years ago Gallium-Arsenide (GaAs) technology was the primary-choice 
semiconductor for implementing RF ICs due to its low noise figure, higher gain and 
higher output power. Advances in sub-micron silicon CMOS, however, have made 
it possible to achieve higher levels of RF system integration at lower cost than with 
GaAs, predominantly for low-GHz-band wireless applications [2]. The other benefits 
of CMOS RF are the greater manufacturability and minimised power requirements to 
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drive off-chip loads. While integrated silicon BJT transceivers are still more desirable 
for today’s products, CMOS RF solutions are looking very promising, with the real- 
istic prospect of a single-chip transceiver in a plastic package. Furthermore, newer 
device technologies such as Silicon Germanium are maturing rapidly and offer the 
high mobility necessary for today’s RF wireless products. This array of competing 
technologies offers system designers more creative opportunity, and the best wireless 
transceiver solutions may well emerge from system design evolving together with ar- 
chitecture, circuits, antennas, and power allocation plans. In the future, base-band 
signal processing will inevitably make up for imperfections in the front end (e.g. soft- 
ware radio). 

Part I begins with a section by Gordon Aspin from TTP Communications, a com- 
pany with vast experience in RF system integration for cellular products. The section 
describes the realistic design of a part of a fully integrated transceiver IC from Hitachi 
which satisfies multi-band GSM RF specifications. Some of the subtleties of board- 
level integration are presented, coupled with a design approach which attempts to 
make practical GSM handset design a more straightforward task. Emphasis is placed 
upon the importance of understanding total system-level requirements when designing 
a chip, and upon how board level design influences low-level requirements. 

In the next section, Peter Mole from Nortel Semiconductors gives an overview of 
system integration on a chip. Peter takes us through a number ofpractical RF design is- 
sues and then discusses general problems that radio systems must overcome to achieve 
acceptable performance. The section overviews practical concerns for both receiver 
and transmitter and how different radio architectures can be utilised to overcome some 
of the problems. The section concludes with a number of practical design issues for 
integrating radio circuitry in silicon technology. 

The final two sections concentrate on the design of fully integrated transceiver chips 
in sub-micron and deep sub-micron Silicon CMOS technologies. Michiel Steyaert 
from the Katholieke Universiteit of Leuven introduces us to the arena of using deep 
sub-micron CMOS to create single-chip transceiver blocks and components such as 
LNAs, VCOs , up-converters , synthesisers etc. to satisfy cellular performance spec- 
ifications above 1 GHz. The section discusses all the bottlenecks and challenges of 
RF CMOS using plain deep sub-micron devices for integration within systems such as 
DECT, GSM, and DCS 1800. 

Finally, Qiuting Huang et. al. from the Integrated Systems Laboratory at ETH 
Zurich presents a practical high-performance GSM transceiver front-end in a 0.25 pm 
CMOS process. This section concludes Part I by taking the reader through a practical 
RF system integration example. The work demonstrates that excellent RF performance 
is feasible with 0.25 pm CMOS, even in terms of the requirements of the super-hetero- 
dyne architecture. Design for low noise and low power for GSM handsets has been 
given particular attention. 

In conclusion, Part I will give the reader a practical evaluation of state- of-the-art 
RF system design and integration for GHz wireless communications. The chapters in 
Part I encompass the failures, successes, and most of all the realistic RF challenges to 
enable total integration of portable future wireless infomiation systems. 
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2 RF SYSTEM BOARD LEVEL 
INTEGRATION FOR MOBILE PHONES 

Gordon J. Aspin 



TTP Communications Ltd. 
Melbourn Science Park 
Royston, SG8 6EE, U.K. 



Abstract: This paper highlights the importance of system design choices in the 
development of RF chip sets for cellular mobile phones. By way of example, the design 
of the BRIGHT family of RF chip sets for GSM is described. Performance results are 
presented based on these chip sets. 

2.1 INTRODUCTION 

Traditionally, the RF system represents one of the most difficult challenges for compa- 
nies developing digital cellular phones, particularly for companies with limited devel- 
opment experience. It is now possible to buy more or less complete baseband-chip-set 
solutions together with associated software from a number of vendors. However, al- 
though off-the-shelf radio-chip-set solutions are available, they nevertheless require a 
significant level of design expertise on the part of the handset designer in order to re- 
alise a manufacturable product which meets the necessary performance requirements. 
This paper describes the results of some of our work to meet the challenge of realising 
an RF chip set which makes designing a GSM mobile phone a straightforward task. 
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This work, the development of the BRIGHT 1 RF chip set family, has been carried out 
in collaboration with Hitachi Semiconductor of Japan. 

The initial devices were targeted at single-band products — the main GSM market 
at that time. Even more important commercially, however, has been the emergence of 
dual-band 900/1800 capability as a key market requirement. Fortunately, the BRIGHT 
architecture is particularly well suited to multi-band operation, and the latest-genera- 
tion devices, BRIGHT2, are designed to support this mode. 

The objective of the development has been to achieve the maximum level of inte- 
gration in the radio system, compatible with the objectives of lowest total system cost, 
lowest part count, and ease of design and manufacture, within the available IC process 
technology. The approach is a systems approach with the focus upon achieving a de- 
vice design in which the radio requirements of the whole product are most simply and 
elegantly met. 

2.2 DESIGN APPROACH 

What makes RF IC design particularly interesting is that the silicon represents only 
a small part of the total RF system but can make a very large difference to how easy 
or difficult it is to design the rest of the system. It is therefore vital to understand the 
total system requirements when designing the chip. It is also important to be able to 
validate the design at both device and system level. This is achieved by means of a 
System Evaluation Board — essentially a complete handset — on which all the system 
performance parameters can be measured and confirmed (Fig. 2.1). Validation of the 
system is often one of the most expensive parts of the development. 

This radio System Evaluation Board is designed to be compatible with Baseband 
Evaluation Boards used for baseband chip set and protocol software development 
work. The two boards can be plugged together to emulate complete handset opera- 
tion. As well as testing in our own laboratories we have even been able to take such 
a system through “Type Approval” at a GSM test house to verify radio system perfor- 
mance. 

2.3 KEY GSM SYSTEM SPECS 

As with all modem radio standards, there are many specification points for GSM that 
need to be met in order for the radio to conform to the standard. However, a small 
number of these specification points turn out to be critical to the radio architecture and 
design. 

2.3.1 Transmitter phase error 

As a digital phase modulation system, GSM controls how closely the transmitted phase 
follows the ideal modulated phase trajectory. It does this by specifying the phase error 
in terms of an RMS value (5°) and a peak value (20°), across the useful part of the 
burst. The phase error is a very critical parameter for the transmitter, because so many 
things can contribute to it, including digital modulator phase enor, synthesiser settling 
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Figure 2.1 BRIGHT 2 system evaluation board. 



time, switching transients, I/Q gain and phase imbalance, up-converter phase noise 
and spurious modulation, and power amplifier AM-to-PM conversion, etc. 

In putting together a radio system design, budgets are allocated to each of these 
parameters. Some, however, can be difficult to quantify theoretically (e.g. switching 
transients) and may only become apparent once the design is realised in hardware. The 
task of the system designer is to identify a system architecture in which uncontrollable 
and unquantifiable effects are minimised. 

A classic example of a problematic architecture is the direct-up-converter transmit- 
ter, in which a baseband I/Q modulator is mixed up to the final frequency in a single 
stage. As a result, the up-converter’s local oscillator runs also at the final frequency. 
Maintaining sufficient isolation between the modulated power amplifier output and the 
unmodulated low-power local oscillator is very difficult, particularly within a minia- 
ture handset. 

2.3.2 Transmitter modulation spectrum (Fig. 2.2) 

Like the phase error, the modulation spectrum is affected by many factors, including 
digital modulator spectmm, I/Q gain and phase imbalance, up-converter phase noise 
and spurious modulation, power amplifier AM-to-PM conversion, transmitter noise 
floor, etc. A good system design will minimise these effects inherently in the design. 

2.3.3 Transmitter noise in the receiver band (Fig. 2.3) 

To avoid interference between handsets in close proximity, GSM limits the amount of 
spurious radiation emitted from the antenna into the receive band. In the non-extended 
band, this is limited to -79 dBm in a 100kHz bandwidth, or -129 dBm/Hz. Reflected 






12 CIRCUITS AND SYSTEMS FOR WIRELESS COMMUNICATIONS 



10 
o 

-10 
-20 
dB -30 
-40 
-50 
-60 
-70 

-600 -400 -200 0 200 400 600 

Frequency from Carrier (kHz) 
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Figure 2.3 Transmit noise in receive bands. 



back to the input of the power amplifier this typically corresponds to a noise level of 
- 162 dBm/Hz or 12 dB above the ideal thermal noise floor. In practice, modulators 
cannot achieve this level of performance, and typically around 20 dB or more of filter- 
ing is required in a duplexer — which is a relatively bulky and expensive component. 
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Figure 2.4 Receiver sensitivity & blocking requirements. 



In the extended GSM900 band, the specification is relaxed by around 12 dB to 
avoid even more severe duplexer filtering requirements. In the 1800MHz bands, the 
specifications are comparable to the E-GSM requirements scaled by the frequency. 

2.3.4 Receiver blocking vs. sensitivity (Fig. 2.4) 

By themselves, neither the receiver blocking nor the receiver sensitivity requirements 
is intrinsically difficult to meet: it is achieving both requirements simultaneously 
within a low-power, low-cost handset that is the design challenge. The sensitivity 
requirements translate into system noise figures of about 10.2 dB at 900 MHz and 
12.2 dB at 1800MHz for a typical GSM equaliser realisation. It is good practice to 
design for a nominal perfomiance at least 4 dB better than these figures to give ade- 
quate production margin. The most challenging blocking requirement is the 3 MHz 
blocking figure which is almost 80 dB up on the wanted signal (74 dB for GSM 1800). 
Translated into voltages on chip, it is necessary to receive a signal at a level of around 
2 pV, without distortion, in the presence of a blocking signal of around 20 mV. This 
degree of linearity must be maintained by the receiver front end, i.e. the LNA, and 
particularly the mixer, before the blocking signal amplitude can be reduced in the first 
IF filter. 

2.4 ARCHITECTURE CHOICES 

The requirements highlighted in the previous section are reflected into various architec- 
tural choices in the design of the BRIGHT devices. On the transmitter, the key choice 
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Figure 2.5 Offset phase locked loop transmitter. 



is the use of an offset phase-locked loop. This approach has substantial benefits for 
systems like GSM which employ constant envelope modulation. 

As a result of the strict phase and modulation accuracy requirements of GSM, it is 
only really feasible to perform transmit carrier modulation in the digital domain. This 
must then be converted into an analogue signal and translated to the required transmit 
frequency. In a conventional approach, the frequency translation is performed in one 
or more up-conversion stages. One of the problems of trying to do this operation in a 
single stage, when the mixer’s local oscillator runs at the same frequency as the trans- 
mitted signal, is that of cross-coupling between the modulated high-level transmitter 
power output and the unmodulated low-level VCO. A solution to this problem is to use 
multiple conversions, but each conversion stage introduces spurious conversion prod- 
ucts which must be filtered out. A better approach is the one illustrated in Fig. 2.5. In 
this scheme, the required signal modulation is impressed upon a VCO by means of a 
phase locked loop. The modulated VCO output is mixed down to a suitable interme- 
diate frequency and compared with the signal from the digital modulator, which has 
been up-converted to the same IF. The loop comparison frequency is not critical and is 
chosen to provide a suitable frequency plan. Channelisation is supported by stepping 
the local oscillator as normal. 

The bandwidth of the loop has a significant effect upon overall transmitter perfor- 
mance. The low-pass filter in the offset PLL must be chosen to optimise the following 
parameters: 

Minimum loop settling time (following frequency steps). 

Gaod phase tracking (to give minimum phase error). 

Minimum in-band spurii (to meet modulation spectmm requirements). 

Suppression of wide-band modulator noise (to meet transmit noise in receive band 
requirements). 

Typically, a loop bandwidth of around 1 MHz is found to be optimum. Using a basic 
offset PLL design of the form shown has clear benefits: 



Tbe VCO is inherently a constant-envelope device, hence no spurious amplitude 
modulation occurs in the signal driving the PA, and therefore we find no spurious 
PM generated in the PA through AM-PM conversion effects. 
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■ The noise floor of the VCO is sufficiently low so that it can meet the receiver-band 
noise -floor requirements without further filtering. The duplex filter can be replaced 
by a simple low-pass harmonic filter and a transmit/receive switch, with consider- 
able cost and space savings. 

■ Removing the duplex filter reduces the loss in the transmit path by around 1 dB. 
This implies up to 25% longer talk time for the handset. 

■ Removing the duplex filter removes a major cause of ripple in the transmit band. 
This allows the handset manufacturer to make use of the margin in the transmit 
level specifications to operate the handset closer to the minimum level, thus giving 
further battery life improvements. 

In the receiver, the architectural choice is mainly about the number of conversion 
stages and the frequency plan. A two-IF approach has been chosen for BRIGHT, 
with a 225 MHz SAW filter defining the first IF. The use of a relatively high first 
IF means that image frequencies and other spurious responses from the first mixer 
cause no particular problems. The majority of receiver gain occurs at the second IF 
of 45 MHz, and it is here also that the AGC is applied. This approach minimises 
power consumption in the receiver. In our reference baseband solution (the Analog 
Devices GSM baseband chip sets), the channel filtering occurs digitally as part of the 
analogue-to-digital converter — another example of system design choices. Thus, 
at 45 MHz, only a relatively wide-band LC filter is required to provide protection 
against blocking of subsequent stages. The architecture of the complete 900 MHz 
BRIGHT device is shown in Fig. 2.6. The main UHF local oscillator runs at 1150 to 
1185 MHz and drives both the receiver first mixer and the transmitter offset mixer. By 
ensuring that the receiver first IF and the transmitter offset loop comparison frequency 
are spaced by the Tx/Rx duplex offset (i.e. 45 MHz) the pulling range of the first local 
oscillator is minimised to the operating bandwidth of 35 MHz. 

The loop comparison frequency is chosen to be a multiple of 45 MHz (in this case 
6 x 45 MHz = 270MHz), to give a simple scheme for generating all other required 
frequencies by on-chip division as shown. 

In terms of integration, all the transmit and receive silicon functionality is integrated 
onto a single device, but the PLL synthesiser functions for the two main oscillators are 
on a separate device. This choice was based on an assessment of the risk factors in- 
volved versus the benefits. There is virtually no difference in printed circuit board area 
whether or not the synthesisers are integrated, yet there is a significant risk of noise in 
the digital PLL synthesiser leaking into the analogue parts of the chip. Furthermore, 
the PLL synthesisers are established parts. 

The front-end low noise amplifier is also not fully integrated within the BRIGHT 
device. Initial studies identified that it would be very difficult to achieve the necessary 
noise figure and blocking performance from an integrated amplifier. Instead it was 
decided to integrate the bias function for the LNA, and to use an external transistor 
as the amplifying device. The final die for the BRIGHT 900 MHz part is shown in 
Fig. 2.7. Some key process and packaging parameters are summarised in Tab. 2.1. 

When the first BRIGHT devices were designed, the main market interest was in 
GSM900. However, support of other single-band standards, DCS 1800 and PCS 1900, 
was also of interest and was a factor in the choice of architecture. As a result, a high 
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Figure 2.7 BRIGHT chip die, 



frequency variant BRIGHT-HF was developed with improved high frequency mixer 
performance. Only towards the end of 1997 did it become apparent that dual-band 
would become a major market requirement, and with that came the further develop- 
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Table 2.1 BRIGHT process parameters. 



Parameter 


Value 


Process Technology 


0.6pm BiCMOS 


Ft 


15 GHz 


r bb' 


150ft 


Supply voltage 


2.7 V 


Package 


LQFP-48 



ment of the BRIGHT2 device. The BRIGHT architecture is well suited to dual-band 
operation, with only small adaptations. On the receive side, the only change is the 
inclusion of a second mixer to support the second band: the first IF and subsequent 
circuitry is shared between both bands. On the transmit side, the only change is the 
inclusion of a different divide ratio for operation at 1800 MHz, giving an offset loop 
comparison frequency of 135 MHz. For minimum VCO range, the difference between 
transmit and receive IFs should equal the Tx/Rx duplex offset. In this case, for the 
1800MHz operation, we have a 90 MHz IF separation, compared to a 75 MHz du- 
plex offset — thus giving around 20% excess pulling requirement over the theoretical 
minimum. 

All other areas of the architecture are essentially unchanged. For market reasons, 
however, a digital AGC is included in BRIGHT2, compared to the analogue AGC 
scheme used in the original BRIGHT. It is found that the digital AGC gives better 
performance than the analogue AGC, because the on-chip DAC can be better matched 
to the characteristics of the AGC amplifier. 

2.5 RESULTS 

2.5.1 Transmitter phase error 

Fig. 2.9 plots the measured peak phase error as a function of channel number in the 
GSM band. The RMS phase error is very consistent at around 3°, peaking up to just 
under 4° in one place. The truly remarkable factor is the independence of the phase 
error with transmit power level. This is testimony to the real benefits of the offset 
phase locked loop approach, where some of the sources of phase error, which cause 
major problems with other architectures, are completely removed by design. Similar 
results have also been reported to us for single -band GSM900 and DCS 1800 phones in 
production, based on the single-band BRIGHT variants. The peak phase error is also 
well within the specification of 20°. Comparable results are obtained at 1800MHz 
(see Fig. 2.10). 
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Figure 2.8 BRIGHT 2 architecture. 
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Figure 2.9 GSM900 phase error. 



2.5.2 Transmitter modulation spectrum 

Fig. 2.11 illustrates the typical modulation spectra achieved with the BRIGHT2 design 
at 900 MHz. The spectrum falls comfortably within the specification mask. Similar 
results are obtained at 1 800MHz. 
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Figure 2.10 GSM1 800 phase error. 



2.5.3 Receive sensitivity 

Fig. 2.12 shows the measured receive sensitivity in both bands plotted as a function 
of channel number. For GSM900, the specification is -102 dBm, for GSM1800, the 
specification is-lOOdBm. A minimum margin of4dB is achieved on these figures, 
with 5-7 dB typical margin. This provides sufficient margin for production and tem- 
perature tolerances. 

The underlying sensitivity variation displayed in the graphs is due to ripple in 
the front-end SAW filter. There are two well-known “deaf’ channels in GSM900 
receivers, channels 5 and 70, at the 72 nd and 73 rd harmonic of the GSM reference 
frequency of 13 MHz. These are clearly visible in the figure. The other deaf channel 
is thought to be due to a local GSM base station transmitting on this channel in the 
vicinity of the laboratory. 
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Figure 2.1 1 Transmitter modulation spectrum. 
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Figure 2.12 Receive sensitivity. 

2. 5. 4 Blocking performance 

Fig. 2.13 shows the blocking performance for BRIGHT2 in the 900 and 1800MHz 
bands. The specification points are 2% BER and -23 dBm for GSM900, -26 dBm 
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for GSM 1800. Around 2-3 dB margin is thus achieved in each band. The blocking 
performance of BRIGHT2 in the GSM900 band is further illustrated in Fig. 2.14. A 
wanted signal is applied, 3 dB above the reference sensitivity level, and then the level 
of blocking signal which causes the receiver BER to degrade to the nominal 2% level, 
is measured. The wanted signal is on channel 61, i.e. at 947.2MHz. 

BER vs 3 MHz Blocking Level 




— o— -3MHz (GSM) 
— +3MHz (GSM) 
-o- -3MHz (DCS) 
— • — +3MHz (DCS) 



Figure 2.1 3 Receiver blocking performance. 



The GSM requirements are as follows: 

■ -43 dBm at ±0.6 MHz, i.e. < 946.6 MHz & > 947.8 MHz 

■ -33 dBm at ±1.6 MHz, i.e. < 945.6 MHz & > 948.8 MHz 

■ -23 dBm at ±3 MHz, i.e. < 944.2 MHz & > 950.2 MHz 

The most difficult requirement is the 3 MHz blocker, and for this, a minimum of 
15 dB margin is demonstrated. Since these measurements were made, it was iden- 
tified that the first mixer was not particularly well-matched (hence the ripple), and 
subsequently better performance with more than 3 dB margin has been demonstrated. 

2.6 FUTURE OPTIONS 

Looking to the future, there are a number of possible options. Firstly, the levels of inte- 
gration can be increased, for example by including on chip the VCOs or the baseband 
interface. The challenge with putting VCOs on chip is to realise circuits of sufficiently 
high Q to meet the phase noise requirements; also low-loss varactors are not normally 
realisable in a BiCMOS IC process. Nevertheless, there is much interest in this area. 

Including the baseband interface within the RF chip is more straightforward in 
principle. Whether this makes commercial sense is more questionable — integration 
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Figure 2.14 Receiver swept blocking measurements. 



does not help reduce the total number of pins in the package very much, and the area 
for a 64-pin QFP is actually greater than for a 48-pin QFP plus a 20-pin SSOP. Also, 
because the baseband interface may be realised in smaller geometries than the RF 
device, the total power consumption may be increased by integration. 

A more fruitful line is perhaps to consider greater functionality. In this regard, 
triple-band phones may be a market requirement soon, a requirement which may be 
supported as a further extension of the BRIGHT architecture. 

In conclusion, it is clear that radio design remains one of the most challenging and 
exciting areas in this industry. Whatever happens in wireless communications over the 
next decade, elegant design at both circuit and system level will continue to be a major 
factor in the success of products, as it has been to date with BRIGHT. 
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3.1 RF ISSUES 

Before considering the details of implementation of an RF system on a chip, it is worth 
spending some time discussing the problems that any radio system must overcome if to 
achieve an acceptable performance. The important aspect to remember is that the radio 
system never operates in isolation. It is not sufficient to think only of the transmitter, 
the link and the receiver, though many problems lie in this simple chain alone. It is also 
important to realise that the transmitter can interfere with other links, and a receiver 
may be unduly sensitive to unwanted, but entirely legitimate, signals. 

3. 1. 1 Receiver concerns 

In any system, a band is defined in which the receiver may receive signals. The re- 
ceiver must be able to reject signals outside the band without loss of performance. 
These signals may be very large (e.g. TV transmissions) and quite capable of over- 
loading a sensitive receiver. If the receiver is overloaded, amplifier compression will 
reduce the receiver gain and hence the ability to detect weak signals, or the non-lin- 
earities in the receiver that are excited by the overload will allow unwanted signals 
to intermodulate. This may result in a distortion product falling onto the wanted sig- 
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nal and effectively masking it. It is therefore imperative that these signals are heavily 
filtered before the input to the receiver. 

The receiver must respond to any signal in the system band, however, it must also 
reject unwanted in-band signals without suffering from overload. Such unwanted sig- 
nals may be significantly larger than the wanted; consider the situation when you are 
trying to make a cellular phone link to your service provider’s remote base-station but, 
unfortunately, you are physically adjacent to a second service provider’s base-station. 
The system specification provides guidelines to the levels of signals which can exist. 
Fig. 3.1 shows the relative levels for signals in the DECT cordless phone system. In 
this system, where communication channels are allocated to both time and frequency 
slots, the frequency channels are separated by 1.728 MHz. A weak signal, (defined in 
the DECT specification as -73 dBm) must be receivable in the presence of a signal 
13 dB stronger in the neighbouring channel and 34 dB stronger in the next neighbour- 
ing channel. To achieve this rejection of neighbouring interferes, a tunable filter is 
required. In practice this is achieved by mixing the signal to a fixed intermediate fre- 
quency and then filtering. The specifications reflect the fact that it is more difficult 
to filter the neighbouring channel than more distant ones. Until the signals can be 
filtered, however, the radio must be designed so that the interferes do not create over- 
load. Unlike out-of-band blocking signals, in-band interferers cannot be filtered prior 
to the sensitive input of the receiver. 



Signal 

Power 




Figure 3.1 Adjacent channel interferers for DECT system. 



The frequency translation that is required in a receiver to move the radio frequency 
input to the required intermediate frequency for filtering is achieved with a mixer. A 
mixer effectively multiplies the RF input signal with a chosen local oscillator (LO) 
signal. Thus we can ideally represent the RF input by sin(a> r f) and the local oscillator 
by cos (coLt). Thus the output can be written as: 

sin&vf -cos&>£/ = 0.5(sin(ay + &>£)/ + sin (av — coi)t) . 

From this we can see immediately that an unwanted output is produced at the sum 
frequency, but this is easily filtered because of the frequency difference. A more subtle 
defect is that two input frequencies (a >l ± &>if) can produce signals at the intermediate 
frequency (coif). Thus there are two frequencies we are sensitive to, the wanted and the 
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image frequency. It is therefore essential that any signal present at the image frequency 
is filtered out in front of the mixer. 

In practice, to achieve efficient mixing, it is normal that the LO signal is close to 
a square wave. Thus the LO signal will contain the third harmonic of the LO (about 
9 dB lower in signal amplitude) which is also multiplied by the input signal. This 
immediately means that signals at (3a) l ± cdif) will also appear at the output as an 
intermediate-frequency signal. Other harmonics will behave similarly. Again input 
signals at these frequencies must be removed prior to mixing. 

Another defect of the LO signal is that it suffers from phase noise (or jitter). This 
means the LO is not a pure tone but a spectrum of signals centred on the desired 
frequency. Thus the mixer will respond to signals close to the wanted signal with a 
sensitivity falling off with the phase-noise spectrum. This problem is known as recip- 
rocal mixing. Thus, to ensure that the receiver can reject neighbouring channels, it is 
important to provide an LO signal with low phase noise. These issues are illustrated 
in Fig. 3.2. 
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Figure 3.2 Non-idealities in a mixer. 



3.1.2 Transmitter concerns 

The transmitter must efficiently produce enough output power to ensure that the sys- 
tem has sufficient range whilst ensuring that power is not transmitted into neighbour- 
ing channels. It is important to understand how power can be transmitted at unwanted 
frequencies. Firstly, harmonics will be generated in an efficient power amplifier. These 
can usually be filtered out before they reach the antenna, but it must be remembered 
that any filter will also attenuate the wanted output, hence reducing the overall effi- 
ciency and increasing the current consumption from the battery. Non-linearity in the 
power amplifier will spread the spectrum of the signal into neighbouring bands. This 
behaviour is known as spectral regrowth. It is worse with modulation schemes that are 
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not constant amplitude. Intermodulation is usually at its worst in the power amplifier 
itself, but intermodulation throughout the transmitter must be considered. 

Broadband noise in the transmitter either due to noise in analogue circuit elements 
or due to quantisation noise if the signal is generated from a D-to-A converter must be 
considered. The latter, in particular, will require careful attention to band filtering to 
ensure low spurious emissions. A low-phase-noise local oscillator is also required to 
keep the noise emissions to acceptable levels. 

In a mechanism similar to phase noise, spurious signals present on the local oscil- 
lator will also give rise to spurious emissions on the output signal. Spurious signals 
can arise from the frequency synthesis process or from unintentional couplings to the 
VCO. On single -chip systems, the identification of these couplings can be particularly 
difficult, but attention must be paid to single-ended CMOS input signals which couple 
strongly to the substrate and also to high-power outputs. An example of the former is 
given in [1]. These effects are illustrated in Fig. 3.3. 




Figure 3.3 Non-idealities in a transmitter. 



3.2 RADIO ARCHITECTURES 

With the issues discussed in the previous section, the advantages of different architec- 
tures can be discussed both in terms of performance and suitability for integration. 

3.2.1 Receiver architectures 

The double-superheterodyne architecture (Fig. 3.4). This is probably the most 
commonly employed architecture in current wireless systems. The out-of-band block- 
ing signals are reduced by an RF bandpass filter placed immediately after the antenna. 
The signal is then amplified by an LNA, which must have a sufficiently low noise to al- 
low detection of weak signals but must also have the dynamic range to handle in-band 
interferers. The bandpass filter is usually insufficient to reduce signals at the image 
frequency to the system noise level, and so a second image filter is inserted prior to 
mixing. To ensure that the image is sufficiently far away from the wanted signal to 
allow effective filtering, a relatively high first intermediate frequency must be chosen 
(for 1 to 2 GFIz RF systems an IF of 100-200 MHz is common). 




INTEGRATION OF RF SYSTEMS ON A CHIP 



27 




Lot Quadrature L02 

Figure 3.4 The double-superheterodyne architecture. 



The mixer must still handle the complete dynamic range of the in-band signal. 
After the mixer, a SAW filter can be used to achieve the channel filtering. At these 
frequencies the SAW filter is small but usually has a large in-band loss when complete 
channel filtering is to be achieved. The output drive of the mixer must therefore boost 
the signal level to allow for this loss. 

Once the interfering channels have been attenuated, the signal can be boosted to 
a high level (it can be limited if a constant-amplitude modulation scheme is used). 
The signal is then reduced to baseband frequency for demodulation. It is of course 
possible to split the channel filtering between the two intermediate frequencies. This 
will require a greater dynamic range in the second mixer. 

This architecture requires the synthesis of two local oscillators, and their frequen- 
cies must be chosen so that spurious responses from the radio are kept to a minimum. 
This aspect of frequency planning, which will not be discussed in more detail here, is 
a well-understood design process which requires considerable care and experience. 

This design requires several external filters and therefore does not lend itself to easy 
integration as the pin count increases. Moreover, the filters are usually single ended — 
though this is not essential — and hence achieving isolation between pins becomes an 
issue. In particular, the channel filter will often need to provide 50 dB of attenuation 
at key frequencies, thus implying that greater isolation must be achieved between the 
pins and with respect to signal ground if the filter response is not to be degraded. The 
image filter can be eliminated if an image-rejecting mixer is used. This will prevent 
the need to come off chip after the LNA and makes an LNA plus image-reject mixer a 
useful integrated building block. 

The direct-conversion architecture (Fig. 3.5). The direct-conversion receiver, be- 
cause of its simplicity, appears to offer the best opportunity for integrated systems. 
Some examples of its use in wireless systems do exist today, but it is not as simple in 
practice. 

Once again, an RF bandpass filter is placed at the input. The LNA's output is 
passed into the mixer. The LNA must handle the same dynamic range as for the 
superheterodyne architecture and it must have enough gain to lift weak signals above 
the noise of the mixer. The mixer however, now converts directly to baseband. Thus 
the signal is its own image, and channel filtering can now be carried out by low-pass 
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Figure 3.5 The direct-conversion architecture. 



baseband filters. Only one local-oscillator frequency needs to be synthesised, and 
frequency planning is straightforward. 

To achieve a precise conversion to baseband, it is important that good quadrature 
can be achieved at the local-oscillator frequency. Unlike the superheterodyne architec- 
ture, this quadrature now needs to be achieved at the RF frequency, which poses some 
problems that are generally soluble. 

The main issue is associated with DC signals which are generated by defects in the 
mixer. These signals are implicitly in-band, it is therefore difficult, and sometimes im- 
possible, to filter them from the wanted signal. It is necessary to keep them sufficiently 
below the signal. Unfortunately, the amplification that can be applied to the signal be- 
fore mixing is limited by the level of in-band interferers which must not overload the 
mixer. 

The DC offsets are caused by a collection of effects. Imbalance in the mixers will 
lead to a DC output. This is generally a constant quantity and could be cancelled with 
suitable circuitry. Any leakage of the local oscillator to the input of the receiver will 
result in a DC signal being generated. If this leakage is via radiation coupling into the 
antenna, then this may vary with the local environment. Finally, non-linearities in the 
mixer may cause signals to be generated at DC from other interferers. The latter mech- 
anisms can be time varying, and any offset cancellation needs to be able to respond 
to time variation. It is only when special precautions are taken to cancel DC offsets 
that direct-conversion architectures can be used. More details of the direct-conversion 
issues are given in [2]. 

Digitisation at IF (Fig. 3.6). This architecture is still a research activity. After the 
first mixer the full dynamic range of the in-band signals may still exist. This full range 
needs to be captured by the A-to-D converter. A bandpass sigma-delta converter offers 
potential for a very flexible system as it does not place high demands on the anti-alias 
filter. With such a system, the final channel filtering can be made programmable in the 
DSP filter to allow a single radio architecture to adapt itself to a range of systems. At 
present the complexity and power consumption in the A-to-D converters provide a sig- 
nificant challenge. As an alternative to bandpass sigma-delta converters, sub-sampled 
or pipeline converters may provide a viable alternative, but the dynamic range which 
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can be accommodated will be limited by the number of available bits and the linearity 
of the converter. It should be noted that a first IF of about 100 MHz will be required 
to enable adequate image rejection to be obtained at the radio frequency. 
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Figure 3.6 Architecture for digitalisation at IF. 



If a very-wide-bandwidth A-to-D converter can be used, this system also offers the 
potential to capture all the activity in the RF band from one receiver. There would be 
no need to tune to individual channels; this could be done by tuning the digital filter. 
This would enable radio base-stations to operate on many channels with one radio 
unit. The number of simultaneous channels that can be received would be limited by 
the number of DSP filters present. 

3.2.2 Transmitter architectures 

Direct up-conversion (Fig. 3.7). Direct up-conversion offers a very flexible means 
of implementing a general modulation. The baseband I and Q signals contain the 
phase and amplitude modulation information and can be generated digitally. They are 
converted to analogue signals using D-to-A converters. The major imperfections of 
the D-to-A converters are quantisation noise and replications of the wanted signal at 
multiples of the D-to-A clocking frequency. To prevent spurious emissions at unac- 
ceptable levels, the signal must be band filtered. The filtering is most easily performed 
while the signal is at baseband. 




Figure 3.7 Direct up-conversion architecture. 



Whilst the signals are at baseband, any non-linearities will create distortion prod- 
ucts which will fall in neighbouring channels. It is therefore important that these stages 
are highly linear as it will be impossible to filter such products once they have been 
converted to RF. 
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To convert the signal to RF, a quadrature up-mix is required. The quality of the 
quadrature will determine the accuracy of the modulation. Any DC offsets will appear 
as a leakage of the local oscillator through to the output. Once there it will appeal - as 
a modulation error. These up-mixer defects must be kept small (about - 40 dBc) in 
order to obtain a high-integrity communication link. Integration and highly balanced 
circuitry offer this possibility. 

The VCO will be very sensitive to pulling from the output signal as it is operating 
in the centre of the output band. Whether this architecture is implemented in an inte- 
grated or a discrete form, unwanted coupling which causes pulling of the VCO will 
require careful consideration. In integrated form, however, evaluation of the interac- 
tion between on-chip components is poorly developed. The design uncertainties in 
this architecture need to be balanced carefully against its undoubted simplicity when 
considering integration. 

Offset up-conversion (Fig. 3.8). In this architecture, the first up-mix is the quadra- 
ture mix and it is to a low first IF. The lower local-oscillator frequency means that a 
good quadrature local oscillator is more easily built and local oscillator feed-through is 
less of an issue. As for the direct-conversion architecture, quantisation noise and repli- 
cations of the signal must be filtered out, though with this architecture the harmonic 
filter can share this task. The harmonic filter also has the job of removing all the un- 
wanted mixer products from the first mix to ensure they do not appear as unwanted 
emissions at the output. 




Reconstruction 
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Figure 3.8 Offset up-conversion architecture. 



The second mixer needs to be a good single-sideband mixer, as the unwanted side- 
band will be offset from the wanted RF signal by twice the first intermediate frequency 
and will cause a significant interference problem if not well suppressed. The final 
mixer can use a quadrature structure to suppress the unwanted sideband, but even with 
balanced on-chip components some degree of filtering will be required to keep the 
unwanted image emission low. To achieve this, the unwanted sideband must lie signif- 
icantly outside the system band, and this will determine the lowest practical limit for 
the first intermediate frequency. 

The selection of the first and second intermediate frequencies must ensure that un- 
wanted mixer products never result in unacceptably high emissions; this problem is 
very similar to that of ensuring lack of spurious responses in the double-superhetero- 
dyne receiver. It does however enable an architecture to be designed where the VCOs 
are operating well away from the power output frequency and are not susceptible to 
pulling. The need for external filtering will make integration of this type of architec- 




INTEGRATION OF RF SYSTEMS ON A CHIP 



31 



ture more difficult, but if an external PA is used anyway, this constraint is not severe. 
The double up-mixer with external harmonic filter is a very useful integrated block. 

Loop-locked and modulated VCO (Fig. 3.9). In this architecture the concept is to 
reduce the need for signal-path filtering by generating a pure modulated signal from 
a VCO. With this concept, since the modulation is effectively applied to the VCO in- 
put, amplitude modulation of the signal is not possible. However for phase/frequency 
modulation schemes this is an attractive option. 
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Figure 3.9 Loop-locked and modulated VCO architecture. 



To ensure that the modulation is precisely controlled about a desired centre fre- 
quency, the modulation is applied in a feedback loop. The aim of the loop is to ensure 
that phase modulation sampled at the VCO output is in exact anti-phase with the phase 
at the I and Q baseband inputs. This will then result in a constant-phase signal at the 
phase detector. Using the high-gain phase detector output to drive the VCO, the result- 
ing loop ensures that this condition is closely maintained. The spurii that can enter the 
VCO are limited by the bandwidth of the lowpass loop filter, enabling simple control 
of spurious signals. Because the signal at the PA is very pure, this simplifies the fil- 
tering after the PA too. Here, only PA generated harmonics need be removed, which 
requires a lower-loss output filter and hence improved power efficiency. 

The absolute frequency is determined by the reference oscillator and the frequency 
offset/division block. These need to be carefully planned. Offsetting the frequency 
requires down-mixing, and it is important to filter unwanted products, as for the su- 
perheterodyne receiver. The frequency planning will however ensure that the VCOs 
should not be susceptible to pulling, and the main VCO is not susceptible as it is phase 
coherent with the PA. It is not clear, however, whether for a given frequency require- 
ment, this architecture can produce a design with low-frequency external filter which 
is easily integrable. 

3.2.3 Architectures overview 

The architectures under consideration are still very similar to those used in radios 
built from discrete components. SAW filters still play a significant role as they pro- 
vide good filter characteristics with a very wide dynamic range whilst consuming no 
power themselves (though we should not overlook the power required to compensate 
for their loss). Architectures which depend on many SAW filters place limits on the 
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integrability due to the need to get signals on and off chip, whilst maintaining good 
signal and ground isolation. 

SAWs are also fixed-performance devices, thus if the radio is required to operate 
across systems, several filter devices which can be switched into the path will be re- 
quired. Architectures which will allow the programming of the filters will be very 
attractive in the future for such multi-mode systems. Architectures where the signal is 
digitised with a wide dynamic range and is filtered and demodulated using the power 
and low cost of data processing implemented in a small-dimension CMOS process 
look attractive for this purpose. However, sufficiently high performance and suffi- 
ciently low power consumption has not been achieved in the analogue to digital con- 
version. 

Implementation in integrated form offers significant advantages for matched and 
balanced circuitry. Thus architectures which exploit this, for example image-rejecting 
mixer based circuits, will evolve as prime candidates for integrated radios. However, 
in any integrated radio, we need to be able to guarantee isolation between the sections. 
Currently, isolation is achieved by PCB and screened-compartment design. The move 
to integration will require that isolation is achieved on chip. At present, design skills 
in this area are low. The choice of architecture to minimise isolation requirements, 
and the development of techniques to calculate the isolation across the chip are going 
to be key factors, if the level of integration of radios is to increase. 



3.3 SOME DESIGN ISSUES 

In this section, some of the design issues for integrating radio circuitry in silicon tech- 
nology are introduced. 



3.3. 1 Power supply and ground coupling 

The essential problem here is that RF currents flowing in the power-supply and ground 
leads can drop significant potentials across track and bond inductances. Moreover, ca- 
pacitances can resonate with the bond/track inductances causing the impedance seen 
by the circuitry to become high at certain frequencies. These frequencies must be sep- 
arated from key circuit frequencies. Supply return currents may cause potentials to be 
dropped on signal grounds. This can give rise to feedback to the input, poor isolation 
and even instability. Fig. 3.10 shows the situation for power supply decoupling. 
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Figure 3.10 Power supply decoupling. 
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In this circuit, some on-chip capacitance has been added to decouple the power 
supply. If we assume the power supply is perfectly decoupled close to the chip, 
then the bond inductance — modified by any mutual coupling between the supply and 
ground lines — will resonate with the internal decoupling capacitance. If the on-chip 
circuitry takes a significant current at this resonant frequency (represented by the cur- 
rent source) then a large power supply ripple will occur. This ripple is limited by the 
loss in the circuitry itself and the loss in the decoupling capacitors. Simulation can 
be used to estimate the magnitude of this ripple and also to decide on the values of 
the capacitance to use (to put the resonance in an innocuous range) and how much 
damping to provide if the circuit does not provide sufficient loss. 

As general design principles: 

■ Separate power supplies for blocks which require isolation. 

■ Keep ground and power supply pins close to exploit mutual inductance. 

■ Add on-chip decoupling, choose resonance away from circuit frequencies. 

■ Damp resonance if circuit does not provide sufficient loss. 

■ Try and separate signal grounds and ground power returns if possible. 

3.3.2 Substrate coupling 

As a general rule, as interconnection and circuitry scales down, cross-talk capacitance 
decreases. This occurs because the length of interconnect required decreases as the 
circuit compacts with scaling, whilst the cross-talk capacitance per unit length varies 
relatively little. This implies that to make higher speed circuitry, all dimensions will 
need to be reduced, and integration is a natural route forward. 

However, with integration, there is also the possibility that signals will couple 
through the substrate. With silicon technology, where the substrate is conducting, 
it is possible for the signal to be conducted through the substrate. Fig. 3.11 illustrates 
this. 




Figure 3.1 1 Coupling through the substrate. 



Devices sitting in the top few microns of the silicon can inject charge capacitively 
into the substrate. Even on a silicon-on-insulator (SOI) technology this will happen, 
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but a good SOI technology will have a low coupling capacitance. The major coupling 
will be to nearby devices or local contacts. Some of the charge will be injected more 
deeply and couple to distant circuit elements. This will be a source of interference. 
Finally, some charge will return via the rear die plate. The die plate potential will vary 
in response to the charge and the return path impedance (which will include capacitive 
coupling to other parts of the circuit and the impedance to ground of the die plate). 
If the die plate does not have a low ground impedance then it can couple the injected 
signal effectively across the whole chip. 

These effects are being studied now. Means of simulation are being developed (for 
example, see [3]). As a general rule, balanced signals will always help injection, and 
local grounded substrate contacts are beneficial. 

3.3.3 On-chlp oscillators 

Oscillators on chip always need careful consideration. Although the maintaining am- 
plifier can be easily accommodated on chip, there is an advantage to keeping the whole 
oscillator on chip. This however requires an on-chip resonator. The phase-noise per- 
formance is directly dependent on the resonator’s quality factor. Spiral inductors can 
be manufactured on chip, but their quality factor is limited by the resistance of the 
metal used to wind the inductor (losses in the substrate are usually a smaller effect). 
Measurements of the phase noise achieved using a 2 pm-thick Al/Si/Cu metallisation 
show that whilst the phase noise required for DECT is attainable, the requirements of 
a system with much narrower bands such as GSM is rather too demanding. 

The second issue to be considered with on-chip oscillators is pulling. The oscil- 
lator can be considered as a highly shaped and amplified noise source. Essentially, 
at the resonant frequency, the feedback circuit provides just enough gain to amplify 
the natural circuit noise to the level required, so this gain is controlled by the limiting 
mechanism. Away from the natural resonant frequency, the loop gain falls away; the 
higher the Q the more rapidly the gain falls off. This leads to two defect mechanisms: 
if a large signal close to resonance couples into the resonator, it will get amplified 
until it causes the limiter to reduce the gain. The noise content at the output will fall 
away, and the oscillator will have locked to the interferer. If an interferer is coupled 
in at a much lower level, then it will not trigger the limiting action; the shaped noise 
will still exist, but the spurious tone will interact with the oscillations and modulate 
the frequency. A side-tone will appear at the interferer frequency and a second one on 
the other side of the main tone — characteristic of EM modulation. Now the sensitivity 
to such an interferer depends on how close to the main tone the interferer is. If it is 
at the resonant frequency itself, the coupling only needs to be at circuit noise levels, 
the level required increases as the coupling tone is offset in frequency. The higher the 
resonator’s Q the higher the level of the interfering signal that can be tolerated. Since 
very small signals may pull an oscillator, the choice of architecture is critical. The 
aim, if an on-chip oscillator is to be included, will always be to ensure that spurious 
signals which could pull or lock the VCO are not present. 
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Abstract: Research into the potentials of CMOS technologies for RF applications 
has been growing enormously in the past few years. The trend towards deep sub-micron 
technologies allows operation frequencies of CMOS circuits above 1 GHz, which opens 
the way to integrated CMOS RF circuits. Several research groups have developed high 
performance down-converters, low phase noise voltage-controlled oscillators and dual- 
modulus pre-scalers in standard CMOS technologies. Research has already resulted 
in fully integrated receivers and VCO circuits with no external components, neither 
tuning nor trimming. Further research on low noise amplifiers, up-converters and syn- 
thesisers has recently resulted in fully integrated CMOS RF transceivers for DCS-1800 
applications. 
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4.1 INTRODUCTION 

A few years ago, the world of wireless communications and its applications started 
to grow rapidly. The driving force for this was the introduction of digital coding 
and digital signal processing in wireless communications. This digital revolution is 
driven by the development of high performance, low cost CMOS technologies which 
allow the integration of an enormous number of digital functions on a single die. This 
in turn allows the use of sophisticated modulation schemes, complex demodulation 
algorithms and high-quality error detection and correction systems, resulting in high- 
performance, lossless digital communication channels. Low cost and a low power 
consumption are the driving forces, and they make the analog front-end the bottle 
neck in future RF designs. Both low cost and low power are closely linked to the trend 
towards full integration. An ever higher level of integration renders significant space, 
cost and power reductions. Many different techniques to obtain a higher degree of 
integration for receivers, transmitters and synthesisers have been presented over the 
past years [1-3]. 

Parallel to the trend to further integration, there is the trend to the integration of 
RF circuitry in CMOS technologies. The mainstream use of CMOS technologies is 
the integration of digital circuitry. If possible, using CMOS technologies to integrate 
high-performance analog circuits has many benefits. The technology is cheap if used 
without any special adaptations towards analog design. Plain CMOS has the additional 
advantage that the performance gap between devices in BiCMOS, nMOS devices in 
deep sub-micron CMOS, and even NMOS devices in the same BiCMOS process is 
becoming smaller and smaller due to the much higher investments into the develop- 
ment of CMOS rather than bipolar technologies. Nowadays, NMOS devices have 
even higher transition frequencies (/,) than NPN devices. 

Although some research has been done in the past on the design of RF circuits 
in CMOS technologies [4], close attention has been given to it only in the past few 
years [5], Today, several research groups at universities and in industry are researching 
this topic [2, 3, 6, 7]. Since bipolar devices are inherently better than CMOS devices, 
RF CMOS is seen by some people as a possibility for only low performance systems 
with reduced specifications (like ISM) [8]. Some say that the CMOS processes need 
adaptations, like substrate etching under inductors. Others feel, however, that the 
benefits of RF CMOS can be much larger, and that it will be possible to use plain 
deep sub-micron CMOS for the full integration of transceivers for high performance 
applications like GSM, DECT and DCS 1800 [2,3]. 

In this chapter, some trends, limitations and problems in technologies for high fre- 
quency design are analysed. Second, the down-converter topologies and implemen- 
tation problems are reviewed. Third, the design and trends towards fully integrated 
low phase-noise PLL circuits are discussed. Finally, the design of fully integrated 
up-converters is addressed. 

4.2 TECHNOLOGY 

Due to the never ending progress in technology down-scaling and the requirement 
to achieve a higher degree of integration for DSP circuits, sub-micron technologies 
are nowadays considered standard CMOS technologies. The trend is even towards 
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deep sub-micron technologies, e.g. transistor lengths of 0. 1 pm and below. Transistors 
whose: / t is near 100 GHz have recently been built in 0.1 pm technologies [9,10]. 




Figure 4.1 Comparison Of ft (■) and / max (□). 



However, the speed increase of deep sub-micron technologies is reduced by the par- 
asitic capacitance of the transistor, meaning the gate -drain overlap capacitances and 
drain-bulk junction capacitances. This can clearly be seen in Fig. 4.1 in the compar- 
ison for different technologies of the f t and the /max, defined as the 3 dB point of a 
diode-connected transistor [11]. The J m ax is more important, because it reflects the 
speed limitation of a transistor in a practical configuration. As can be seen, the / t 
rapidly increases, but for real circuit designs (/max) die speed improvement is only 
moderate. 

Finally, recent integrations of CMOS RF circuits [5] have made it clear that the 
limiting factor will not be the technology, but the packaging. Since the RF signals 
have to come off the chip sooner or later, and since the RF antenna signal has to get 
into the chip, any parasitic PCB, packaging or bond-wire capacitance in combination 
with the ESD protection network and packaging pin capacitances will strongly affect 
and degrade the RF signal. 

4.3 FULLY INTEGRATED CMOS DOWN-CONVERTERS 

The heterodyne or IF receiver is the best known and most frequently used receiver 
topology. In the IF receiver, the wanted signal is down-converted to a relatively high 
intermediate frequency. A high-quality passive bandpass filter is used to prevent a 
mirror signal to be folded upon the wanted signal on the IF frequency. Very high 
performance can be achieved with the IF receiver topology, especially when several IF 
stages are used. The main problem of the IF receiver is the poor degree of integration 
that can be achieved because every stage requires going off-chip and using a discrete 
bandpass filter. This is both costly (the cost of the discrete filters and the high pin-count 
of the receiver chip) and power consuming (often the discrete filters have to be driven 
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by a 50 £2 signal source). Moreover, in CMOS RF circuit design input and output is 
becoming a serious problem already in the lower GHz frequency range. 

The homodyne or zero-IF receiver has been introduced as an alternative for the IF 
receiver. It can achieve a much higher degree of integration. The zero-IF receiver 
uses a direct, quadrature down-conversion of the wanted signal to the baseband. The 
wanted signal is its own mirror signal, and sufficient mirror signal suppression can 
therefore be achieved even with a limited quadrature accuracy. Theoretically, there 
is thus no discrete high-frequency bandpass filter required in the zero-IF receiver, al- 
lowing the realisation of a fully integrated receiver, especially if the down-conversion 
is performed in a single stage (e.g. directly from 900 MHz to the baseband) [6]. The 
problem of the zero-IF receiver is its poor performance compared to IF-receivers. The 
zero-IF receiver is intrinsically very sensitive to parasitic baseband signals like DC- 
offset voltages and crosstalk products caused by RF and LO self-mixing. These draw- 
backs have kept the zero-IF receiver from being used on a large scale in new wireless 
applications. The use of the zero-IF receiver has therefore been limited to low-perfor- 
mance applications like pagers and ISM, in which the coding can be scrambled so that 
a high-pass filter can be inserted to avoid the DC offset problems. Another application 
is the use as a second stage in a combined IF/zero-IF receiver topology [12,13]. It 
has, however, been shown that with the use of dynamic non-linear DC -correction algo- 
rithms implemented in the DSP, the zero-IF topology can be used for high performance 
applications like GSM and DECT [1,14]. 

In recent years, new receiver topologies like the low-IF receiver [2,15] were in- 
troduced for use in high-performance applications. The low-IF receiver performs a 
down-conversion from the antenna frequency directly down to, as the name already 
indicates, a low IF (i.e. in the range of a few 100 kHz) [2], Down-conversion is done 
in quadrature and the mirror signal suppression is performed at low frequency, af- 
ter down-conversion, in the DSP. The low-IF receiver is thus closely related with the 
zero-IF receiver. It can be fully integrated (it does not require an HF mirror signal 
suppression filter) and uses single stage direct down-conversion. The difference is that 
the low-IF receiver does not use baseband operation, resulting in a total immunity to 
parasitic baseband signals, resolving in this way the main disadvantage of the zero-IF 
receiver. The drawback is that the mirror signal is different from the wanted signal 
in the low-IF receiver topology, but by carefully choosing the IF frequency, an adja- 
cent channel with low signal levels can be selected for which the typical mirror signal 
suppression (i.e. a 3° phase accuracy) is sufficient. 

Fig. 4.2 shows the block diagram of a fully integrated quadrature down-converter 
realised in a 0.7 pm CMOS process [2]. The proposed down-converter does not require 
any external components, nor does it require any tuning or trimming. It uses a newly 
developed double-quadrature structure having a very high quadrature accuracy (less 
than 0.3° in a very large passband). The topology used for the down-converter is 
based on NMOS transistors in the linear region [2,5]. In combination with capacitors 
on virtual ground, only a low-frequency opamp is required. Using the MOS transistors 
in the linear region results in very high linearity for both the RF and the LO input. This 
is reflected in a very high third order intercept point (IP3) for the mixers, which is over 
+45 dBm. The advantages of such a high linearity on both inputs are that the mixer can 
handle a very high third order intermodulation-free dynamic range IMFDR3, resulting 
in no need for any kind of HF filtering. 
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Figure 4.2 A double-quadrature down-conversion mixer. 



4.4 THE SYNTHESISER 

The local oscillator is responsible for the correct frequency selection in the up- and 
down-converters. Since the frequency spectrum in modem wireless communication 
systems must be used as efficiently as possible, channels are placed together very 
closely. The signal level of the desired receive channel can be very small, whereas 
adjacent channels can have very large power levels. Therefore the phase noise specifi- 
cations for the LO signal are very hard to meet. 

Meanwhile, mobile communication means low power consumption, low cost and 
low weight. This implies that a completely integrated synthesiser is desirable, where 
“integrated” means a standard CMOS technology without any external components or 
processing steps. Usually, the LO is realised as a phase-locked loop. The very hard 
specs are reflected in the design of the Voltage-Controlled Oscillator (VCO) and the 
Dual-Modulus Pre-scaler (DMP). 

The phase noise of a VCO (L[Am)) is one of the most important specifications 
for telecommunication applications. If there is sideband phase noise at an offset A co 
equal to the channel spacing of the system, the down-converter will superimpose the 
adjacent channel onto the signal in the receive channel. As a result, this will reduce 
the signal to noise ratio (SNR) of the wanted signal. 

For the realisation of a gigahertz VCO in a sub-micron CMOS technology, two 
options exist: ring oscillators or oscillators based on the resonance frequency of an 
LC-tank. The inductor in this LC-tank can be implemented as an active inductor or a 
passive one. It has been shown that for ring oscillators as well as active LC-oscillators, 
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the phase noise is inversely related to the power consumption [16]. 

Ring oscillator [10]: L[Aa>] ~ kT ■ R ( ) with g m = — 

\A<j}/ R 

kT / w \2 

Active-LC [7]: L{ A<u) ~ - — ) with g m = 2cuC (4.1) 

2aiC \Aa)/ 

Therefore, the only viable way to obtain a low-power, low-phase-noise VCO is an 
LC-oscillator with a passive inductor. In this case, the phase noise changes proportion- 
ally with the power consumption: 

Passive-LC [7]: L{A(o) ~ kT ■ R with g m = R ■ (cuC) 2 (4.2) 

VAo)/ 

As could be expected, the limitation in this oscillator is the integrated passive induc- 
tor. Equation (4.2) shows that for low phase noise, the resistance R (i.e. the equivalent 
series resistance in the LC-loop) must be as small as possible. Since the resistance R 
will be dominated by the contribution of the inductor’s series resistance, the inductor 
design is critical. Three solutions exist: 

1. Spiral inductors on a silicon substrate usually suffer from high losses in this sub- 
strate, which limit the obtainable 0-factor. Recently, techniques have been de- 
veloped to etch this substrate away underneath the spiral coil in a post-processing 
step [17]. However, since this requires an extra etching step after normal processing 
of the ICs, this technique is not allowed for mass production. 




Figure 4.3 A 0.4 pm CMOS up-converter. 



2. For extremely low phase-noise requirements, the concept of bondwire inductors has 
been investigated. Since a bondwire has a parasitic inductance of approximately 
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1 nH/mm and a very low series resistance, very-high-^ inductors can be created. 
Bondwires are always available in IC technology and can therefore be regarded 
as being standard CMOS components. Two inductors, formed by four bondwires, 
can be combined in an enhanced LC-tank to allow a noise/power trade-off [16]. A 
micro-photograph of the VCO is shown in Fig. 4.3 [18]. The measured phase noise 
is as low as -115dBc/Hz at an offset frequency of 200 kHz from the 1.8-GHz 
carrier. The power consumption is only 8 mA at 3 V supply. Although chip-to-chip 
bonds are used in mass commercial products, they are not characterised on yield 
performance for mass production. Therefore, the industry is reluctant to accept this 
solution. 

3. The most elegant solution is the use of a spiral coil on a standard silicon substrate 
without any modifications. Bipolar implementations do not suffer from substrate 
losses, because they usually have a high-ohmic substrate [19]. Most sub-micron 
CMOS technologies use a highly doped substrate and have therefore large induced 
currents in the substrate which are responsible for the high losses [20]. The ef- 
fects present in these low-ohmic substrates can be investigated with finite-element 
simulations. This analysis leads to an optimised coil design, which is used in the 
spiral-inductor LC -oscillator shown in Fig 4.4. Only two metal layers are available, 
and the substrate is highly doped. With a power consumption as low as 6 mW, a 
phase noise of - 116dBc/Hz at 600kHz offset from the 1.8-GHz carrier was ob- 
tained [21], 

4.5 RF CMOS UP-CONVERTERS 

Until now, mainly CMOS down-conversion mixers have been reported in the open lit- 
erature. It is only recently that CMOS converters are presented and results are demon- 
strated [22], In classical bipolar transceiver implementations, the up- and down-con- 
verter mixers typically use the same four-quadrant topology. There are, however, some 
fundamental differences between up- and down-converters, which can be exploited to 
derive optimal dedicated mixer topologies. In a down-conversion topology, the two 
input signals are at a high frequency (e.g. 900 MHz for GSM systems) and the output 
signal is a low-frequency signal of at most a few MHz for low-IF or zero-IF receiver 
systems. This extra degree of freedom has been used in the design of a very successful 
down-converter-only CMOS mixer topology [5]. 

For up-conversion mixers, the situation is totally different. The high-frequency lo- 
cal oscillator (LO) and the low-frequency baseband (BB) input signal are multiplied to 
form a high-frequency output signal. All further signal processing has to be performed 
at high frequencies, which is very difficult and power consuming when using current 
sub-micron CMOS technologies. Furthermore, all unwanted signals like the intermod- 
ulation products and LO leakage have to be limited to a level below e.g. -30dB of the 
signal level. 

Many published CMOS mixer topologies are based on the traditional variable- 
transconductance multiplier with cross-coupled differential modulator stages. Since 
the operation of the classical bipolar cross-coupled differential modulator stages is 
based on the translinear behaviour of the bipolar transistor, the MOS counterpart can 
only be effectively used in the modulator or switching mode. Large LO signals have 
to be used to drive the gates, resulting in a huge LO feed-through. This is a problem 
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Figure 4.4 Micro-photograph of the hollow-spiral VCO. 



of CMOS down-converters as well; e.g. in [23] the output signal level is -23 dBm 
with a LO feed-through signal of -30 dBm, which represents a suppression of only 
7 dB. This gives rise to very severe problems in direct up-conversion topologies. The 
problems above can be overcome in CMOS by linearly modulating the current of a 
MOS mixing transistor biased in its linear region. For a gate voltage of Vi + Vi ni ,a 
drain voltage of V 2 + Hm 2 /2 and a source voltage of V 2 — Vi„ 2 /2, the current through 
the transistors is given by: 



IDS = P ■ (Him • Hin 2 ) + P‘(V\-Vj- V 2 ) • Vi„ 2 (4.3) 

When the LO signal is connected to the gate, and the baseband signal to Uj n2 , the 
current contains frequency components around the LO due to the first term, and com- 
ponents of the baseband signal due to the second term in equation (4.3), respectively. 
Based on this principle, a 1 GHz up-converter has been presented in a standard CMOS 
technology [22] (see Fig. 4.3). 

All unwanted measured signals are below -30dBc. For example, the mixer of [12] 
is implemented in a 0.7 pm CMOS technology which achieves an / max of only 6 GHz 
for a gate over-drive of 1 V or a gm/ 1 ratio of only 2. Typical bipolar technologies 
used for the implementation of 900 MHz fully integrated transceivers have cut-off 
frequencies of over 20GHz [12]. Due to the low gm/I of present sub-micron tech- 
nologies suitable for high frequency operation, the power consumption of CMOS pre- 
amplifiers will be up to 20 times higher than of bipolar ones. However, thanks to the 
rapid down-scaling of CMOS technologies, the present CMOS building block realiza- 
tions show that full CMOS transmitters with an acceptable power consumption are 
feasible in very deep sub-micron CMOS, but some considerable research has still to 
be performed. 
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4.6 FULLY INTEGRATED CMOS TRANSCEIVERS 

Combining all the techniques described above has recently resulted in the development 
of single-chip CMOS transceiver circuits [24]. In Fig. 4.5, a micro-photograph of the 
0.25 pm CMOS transceiver is presented. The chip does not require a single external 
component, nor does it require any tuning or trimming. The objective of this design is 
to develop a complete system for wireless communications at 1.8 GHz that can be built 
with a minimum of surrounding components: only an antenna, a duplexer, a power am- 
plifier and a baseband signal processing chip are needed. The high level of integration 
is achieved by using a low-IF topology for reception, a direct quadrature up-conver- 
sion topology for transmission, and an oscillator with on-chip integrated inductor. The 
presented chip has been designed for the DCS-1800 system, but the broadband nature 
of the LNA, the down-converter, the up-converter and the output amplifier makes the 
presented techniques equally suitable for use at other frequencies, e.g. for use in a 
DCS-1900 or a DECT system. The presented circuit consumes 240 mW from a 2.5 V 
supply and occupies a die area of 8.6mm 2 . 




Figure 4.5 A 0.25 pm CMOS transceiver. 



4.7 CONCLUSIONS 

The trend towards deep sub-micron technologies has resulted in the exploration of 
the possible use of CMOS technologies for the design of RF circuits by several re- 
search groups. Especially the development of new receiver topologies, such as low-IF 
topologies, in combination with highly linear down-converters, has opened the way to 
fully integrated down-converters with no external filters or components. The trends 
towards deep sub-micron technologies will allow to achieve those goals as long as the 
short-channel effects will not limit the performance concerning linearity and intermod- 
ulation problems. 

Fully integrated CMOS VCO circuits having high performance, low phase noise 
and low power drain were shown in this paper. In a first step, the use of post processing 
techniques has enabled the use of standard CMOS technologies, using bondwires as 
inductors. 

Today, low phase noise performance can be achieved with optimised integrated 
spiral inductors in standard CMOS technologies even without any post-processing, 
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tuning, trimming or external components. This opens the way towards fully integrated 
receiver circuits. 

Recently, CMOS up-converters with moderate output power were presented in the 
open literature. Again, thanks to the trends towards deep sub-micron, this opens the 
way towards fully integrated transceiver circuits in standard CMOS technologies. 
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Abstract: So far, CMOS has been shown to be capable of operating at RF fre- 
quencies, although the inadequacies of the device-level performance often have to be 
circumvented by innovations at the architectural level that tend to shift the burden to 
the circuit building blocks operating at lower frequencies. The RF front-end circuits 
presented in this paper show that excellent RF performance is feasible with 0.25 pm 
CMOS, even in terms of the requirements of the tried-and-true superheterodyne archi- 
tecture. Design for low noise and low current consumption targeted for GSM handsets 
has been given particular attention in this paper. Low-noise amplifiers with sub-2 dB 
noise figures, a double-balanced mixer with 12.6dB SSB NF, as well as sub- 25mA 
current consumption for the RF front-end (complete receiver), are among the main 
achievements. 

5.1 INTRODUCTION 

The quest for miniaturized cellular telephone handsets has resulted in the development 
of commercial transceiver ICs of increasing integration levels in the last decade. The 
more traditional RF front-ends comprising a single LNA or LNA-mixer combination. 
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though they still have a strong presence in the market, are expected to slowly give way 
to the more highly integrated front-ends that combine the RF mixer, or even the LNA, 
with the IF-strip and baseband demodulators in the receiver [1-5]. The dominant 
technology for commercial RF front-ends has so far been advanced bipolar junction 
transistor (BJT) technology. 

The last five years have also seen CMOS, traditionally confined to the digital and 
baseband part of radio transceivers, make successful in-roads into the RF sections. At 
the research level, highly integrated CMOS RF transceivers have been reported for 
wireless LAN and cordless telephones [6,7], that are highly innovative in terms of 
architecture design. For cellular applications, where weight, size, and stand-by power 
requirements of the handset are much more demanding and the radio network envi- 
ronment more complex (less restricted in terms of possible interfering signals) and 
hostile, CMOS implementations of RF front-ends have still fallen far short of BJT per- 
formance, especially in terms of current consumption [6,8]. Compared to the typical 
current consumption of 50 mA in highly integrated BJT receivers, CMOS receivers 
published so far typically consume 100 mA, which makes them an unattractive alter- 
native to BJT solutions because of the corresponding drop in stand-by time or increase 
in battery size and weight. 

In this work, we report a low-power CMOS RF front-end which is part of our 
on-going effort to develop a complete RF transceiver in deep-submicron CMOS, and 
which not only meets the more stringent requirements of cellular telephony, but is 
also competitive to BJT implementations in terms of current consumption. Instead of 
taking on simultaneously the difficulties of the development of both a novel architec- 
ture and a low power implementation of RF circuits using CMOS, we have opted for 
the path of first gaining experience in the latter. The receiver architecture for our RF 
front-end is therefore the well-established and widely used superheterodyne receiver. 
Since more than 70% of the power in a typical receiver is consumed by the building 
blocks that operate at RF frequencies, our initial work has concentrated on the LNA 
and mixer for the receiver, and the preamplifier for the transmitter, which we report 
in this paper. The IF-strip plus the baseband circuits will form another chip that will 
be the subject of another report [9]. The ultimate goal of this work is to combine the 
complete receiver and transmitter into a single IC that meets at least the performance 
required for small mobile stations (MS). 

The paper is organized as follows. Section 5.2 provides a general description of 
the transceiver chip architecture and the required performance. Design considerations 
of the LNA and mixer, including the many trade-offs involved, are presented in Sec- 
tions 5.3 and 5.4, before the transmitter preamp design is described in Section 5.5. 
Sections 5.6 and 5.7 conclude the paper with discussions of the achieved power con- 
sumption and measured performance, respectively. 

5.2 TRANSCEIVER ARCHITECTURE AND RELEVANT GSM 
SPECIFICATIONS [10] 

Receiver planning typically consists of trade-offs, such as the number of IFs, the exact 
frequency of each IF, and the distribution of gain, linearity, and NF to each block in 
the chain, etc. In addition to the minimum performance for type approval, many other 
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important parameters such as size, complexity, power consumption and cost must be 
carefully taken into consideration. 

In our case, the partners responsible for system design have specified the single-IF 
superhet architecture shown by the block diagram in Fig. 5.1. The relatively high IF of 
71 MFIz was primarily dictated by image-rejection requirements and the availability of 
low-cost commercial filters. Realizing high gain at such a high frequency had raised 
concerns over the power consumption and stability of the IF AGC. Those concerns 
have been addressed when an earlier test chip [1 1] we implemented showed that with a 
good design, a relatively low-power IF amplifier is feasible at 7 1 MHz. Since complete 
channel filtering at the 71 MHz IF is not possible without using high-quality filters, 
whose price is not justified for this application, intermodulation performance of the 
IF amp is critical, adding further complexity to its design. To limit these problems 
and achieve a feasible design, 32 dB of the total gain was allocated to baseband, and 
the maximum gain of the IF amplifier has been limited to 60 dB. Eighty decibels of 
AGC range is implemented in the IF amp which is digitally programmable from -20 
to +60 dB in 2 dB steps. 
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Figure 5.1 Transceiver block diagram — single-IF superhet receiver, direct up- 
conversion transmitter. 



The large dynamic range (—102 dBm to —15 dBm)of the input signal as specified 
in GSM standards, for which a BER < 10 -3 must be maintained, requires at least 
87 dB of AGC range for the receiver to avoid saturating the baseband A/D converters, 
in addition to the necessary linearity. Allowing for a 10-15 dB margin, the receiver 
needs to be designed with a 100 dB AGC range. Since the IF amplifier is program- 
mable over an 80 dB range, the additional 20 dB are implemented in the LNA with a 
bypass switch. 

The gains of the LNA and the RF mixer require another trade-off in receiver plan- 
ning. The main parameters to be taken into consideration are sensitivity, intermodu- 
lation and blocking performance, as well as power consumption. Using high gain in 
the LNA helps to reduce the noise figure by compensating the insertion loss of the in- 
terstage filter and scaling down the noise contribution of the mixer. That is, however, 
at the expense of higher power consumption and risk of early overloading of both the 
LNA output stage and of the mixer. 

The blocking signal levels that may overload the LNA are depicted in Fig. 5.2, as 
prescribed by GSM testing for type approval. At 20 MHz away from the MS receive 
bands, OdBm can be expected at the receiver front-end. After 25-30 dB attenuation 
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by a typical RF filter or duplexer, the blocking signal at the output of a 15 dB LNA 
can still be as high as -10 dBm. For 5-10 mA output current, the -1 dB compression 
point can not be expected to be much higher than -5 dBm in a 50 £2 environment at 
the output of the LNA. If we expect the LNA to operate with blocking signals some 
6 dB below its compression point, then its gain must be limited to 15 dB. Since 3-4 dB 
loss is expected to come from the interstage filter after the LNA, the noise power of 
the mixer is only divided by 16 before it is added to the input noise of the LNA. If the 
overall NF specification is tight, both the LNA and the mixer must also have low NF. 




Figure 5.2 Template of blocking signal levels specified by the GSM standard. 



In digital wireless communications, the noise figure of the receiver is a particularly 
critical parameter, because, at the boundary between very low and very high bit-error 
rate, the transition can be effected by just 1-2 dB of signal-to-noise ratio. A low 
noise figure enables satisfactory reception even for low levels of received signal power 
which, in turn, allows lower-power transmission from the base station and lower levels 
of interference in the radio environment in general. 

For GSM type approval, a reference sensitivity of -102 dBm is specified for small 
mobile stations. This means that for static additive white Gaussian noise (AWGN) 
channel conditions, a BER < 10~ 4 must be achieved when the input signal power P, 
is -102 dBm. For the GMSK modulation scheme used in GSM, an SNR of about 
9 dB is required to reach such a BER, assuming an optimum maximum-likelihood 
sequence estimation (MLSE) receiver. If a sub-optimal MS K- type detector is used, an 
SNR of 10 dB is required [12]. The simplicity of the MSK-type detector is of interest 
here as it makes the latter well suited for low power implementation, while offering 
essentially the same performance as MLSE, especially under conditions more realistic 
than a static AWGN channel. Bearing in mind that the (antenna) source noise power 
density at a temperature of 290 K is -174 dBm/Hz, a noise power P„ of -121 dBm 
results for a 200kHz channel. The required NF after the antenna can therefore be 
calculated in dB as NF = P, — P„ — SNR = 9dB. 

In order to attenuate blocking interferers an RF filter, or often a duplexer, is required 
between the antenna and the LNA. Its passband loss, however, translates directly into 
a serious NF degradation for the receiver. A typical duplexer [13], like the one which 
we intend to use, has an insertion loss of 3.2 dB for the Rx path (1.2 dB for the TX 
path). This leaves an overall NF of less than 6dB at the LNA input. 
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A typical RX image-reject interstage filter has an insertion loss of 3MdB (3.8 dB 
in [13]), which degrades the NF at the mixer input by the same amount. As a re- 
sult, even if the NFs of the LNA and mixer are as low as 2dB and 12dB (single 
sideband, SSB) respectively, the overall NF will still be 7.6 dB, leaving only a small 
margin for gain tolerances, insertion loss of the IF filter, noise contribution from the 
IF amplifier and the imperfect rejection of the LNA’s noise at the image frequency. To 
prevent excessive NF contributions from the IF-strlp, a gain of 10 dB is specified for 
the mixer. If we restrict the NF of the IF-strip (including the IF filter’ s insertion loss) to 
15 dB, the overall NF is just under 8 dB. The 15 dB limit can be met, for example, by 
an IF AGC with a NF of 8 dB [11] combined with an IF filter having 5.1 dB insertion 
loss [13], or a cheaper IF filter with 11.3 dB loss [13] could be tolerated with a better 
AGC design with a 4 dB NF, which is more difficult. Should the gain of the LNA drop 
to 14 dB due to component variations, the overall NF becomes 8.5 dB, which is still 
within specifications. The noise figures are therefore specified as 2dB and 12 dB SSB 
for the LNA and the mixer respectively, both demanding values even for BJT or GaAs 
technologies. 

To save the interstage filter and eliminate the adverse effect of its insertion loss on 
NF, image-reject mixers can be used as an alternative. By operating two mixers in 
quadrature the image is usually attenuated by 30-35 dB, which is sufficient to prevent 
noise at the image frequency from contributing to the total NF excessively. Flaving 
two mixers operating at 900 MFIz and a phase shifter in both the local oscillator and 
IF paths, however, results in power consumption roughly 3M times higher than that 
of one conventional mixer, which was considered unacceptable for the present design. 

Compared to the receiver, the transmitter requirements are relatively simple. The 
most difficult one is perhaps meeting the -36 dBm spurious signal emission limit. 

Without any IF or image frequency, a direct up-conversion transmitter is less likely 
to generate spurious signals. The filtering requirements to eliminate leakage are there- 
fore greatly relaxed. Power consumption, one of the disadvantages of direct conver- 
sion, is not as critical here, since it will be dominated by the poweramplifier (~ 5 W av- 
erage input power). The direct up-conversion architecture, shown in Fig. 5.1, is there- 
fore preferred. 

Since the local oscillator will be integrated on the same chip as the transmitter, 
oscillator pulling may be an issue. Using an offset local oscillator will solve this 
problem, at the expense of an extra external filter. 

The integrated part of the transmitter consists of a vector modulator driving a pre- 
amp via an external interstage SAW filter [13]. In the present front-end, only the trans- 
mitter preamp has been integrated to test the ability of CMOS to deliver the 2 mW of 
power which are typically required by commercial power amplifiers. Thanks to the 
constant envelope modulation, linearity is not too critical, and the preamp can be dri- 
ven at the compression point for better efficiency. The nominal gain of the preamp is 
23 dB. 

5.3 LOW-NOISE AMPLIFIER 

In addition to the gain and noise figure, as discussed in the previous section, the re- 
quirements put on the LNA include 50 G matching at both the input and output, small- 
signal linearity (IP3) and the large-signal linearity, which is described by the -1 dB 
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compression point. In addition, these performance parameters must be relatively in- 
sensitive to the variations of process parameters and values of passive components 
such as the matching inductor and capacitors [14]. 



5.3.1 Input stage 

The most important factor that determines our choice of the LNA configuration is the 
achievable noise figure. Although noise figures of 3 dB and better are usually quite 
easy to achieve given sufficient current, integrated LNAs do not often have NFs below 
2 dB even in GaAs and BJT technologies. This apparent barrier can be partially ex- 
plained by the definition of the noise figure itself, which results in a law of diminishing 
returns. 

Since the input-referred noise of a MOS transistor is given by 4ykT/g m , with 
y = 2 /3, the noise factor of a common gate (C-G) amplifier is given by 1 + y. The 
minimum NF achievable by a C-G amplifier is therefore 2.2 dB. With the additional 
noise contributions of pads, the substrate and the output stage, the achievable NF is 
closer to 3 dB, which is too high for the present application. 

For a common source configuration, as shown in Fig. 5.3, the real part of the input 
impedance required for 50 G matching is generated by inductive degeneration [14- 
16]. The matching network formed by L \ and Li and the gate capacitance C gs gives 
a voltage gain between the LNA input (marked RF) and the gate-source of transistor 
Ml which is equal to the quality factor Q of the matching network at resonance. 




vdd 



out 



Figure 5.3 Common-source LNA with transimpedance output stage. 



Q = 



l 

gmCOoLi 



(5.1) 



If Q is greater than one, this gain may help reduce the contribution of the thermal 
noise of Ml’s channel, 1%, and of those noise sources from the LNA’s output stage 
which contribute to the overall LNA noise figure. For example, the NF contribution of 
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/£ alone is given under matching conditions by 



NF = 1 + y 



coqL\ 

QRs 



1 + Y 



1 

Q 2 gmRs' 



(5.2) 



where y = 2 / 3 for long-channel devices. Even if the transconductance is set to only 
20 mS as in the case of the C-G configuration, the voltage gain of Q can still reduce 
the noise contribution from /£ to very low levels. Indeed, this well-defined noise 
contribution reduces to insignificant levels, even with only moderate values of Q and 
g m . The NF of common source LNAs, typically 1.5dB to 3 dB, (even if integrated 
in GaAs and BJT technologies) must be dominated by other less well-defined noise 
sources. This latter category of noise sources includes the contribution of substrate 
resistance through capacitances under bonding pads, gate-induced noise current [15] 
and the back-gating effect of the substrate resistance under the MOS transistor [17]. 

A more important reason to use Q > 1 to achieve some voltage gain is to reduce the 
current level required for a given overall transconductance of the input stage. Under 
matching conditions, and neglecting the parasitic capacitances associated with input 
pads, the ratio between the output current of the input stage and the input voltage of the 
LNA is exactly the inverse of L i ’s impedance, independent of the transconductance 
gm of Ml. 



I G m \ = ~^— (5.3) 

cuo L\ 

The overall voltage gain of the LNA is then given by the product of G m and the 
load (trans)impedance of the output stage. The latter impedance is constrained by the 
output matching requirement to about 50 £2. For a LNA gain of 15 dB, G m 120mS, 
so that at 1 GHz the value of L i lies around 1 nH. This happens to be the value of a 
typical bond- wire, so that no board-level inductor is necessary to implement L\. 

Once this overall G m or bond-wire impedance is determined, the transconductance 
g m of Ml is related to Q by Eq. (5.1). If we use the simple formula 

gm =j2nC' ox W-l/L 

to estimate the current consumption, and bear in mind one of the conditions for imped- 
ance matching, 



gmLi—Cg S R s (5.4) 

then we see that the bias current I is inversely proportional to Q. This is important for 
a CMOS LNA design, because without this voltage gain (i.e. of Q were unity), 10 mA 
would be needed for a G m of 120 mS even at a gate length of 0.25 pm While a high Q 
matching network allows the current consumption of the LNA input stage to be low, 
the level of reactance associated with the network inductors and capacitor becomes 
high. Although the nominal capacitive and inductive reactance should cancel each 
other if designed for 50 £2 matching, the same 10% deviation from the nominal values 
results in higher residual (uncanceled) reactance for higher Q. The variability of the 
input reflection coefficient Sll is therefore also worse. One of the serious effects of 
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the increase in Sll and deviation from 50 Q input resistance is the decrease of the RF 
filter’s performance [14]. 

Measurements of a commercial duplexer [13] in Fig. 5.4 show that |S11| = — 6dB 
can both increase the filter’s pass-band loss by 0.5 dB, causing a corresponding in- 
crease in the overall noise figure, and at the same time degrade the stop-band attenu- 
ation by as much as 6 dB at 20 MHz away from the passband, at frequencies where 
0 dBm blocking signals are expected. 





Figure 5.4 Measured duplexer characteristics. The thick curve is with exact 50 £2 
matching. The other four were measured with imperfect S1 1 : IS1 1 1 = -6 dB, phase of 
S1 1 =45°, 135°, 225° and 315°. a) Overall transfer function, b) close-up of passband. 



In our design, a fairly low quality factor of 2.5 is used for the input matching net- 
work, so that for the 10-20% tolerance expected of L\, Li and C gs , the variability 
of S 1 1 is still very low. On the other hand, the bias current of the first stage is reduced 
to 4 mA, which is sufficiently low in the context of the target current consumption of 
25-50 mA for the complete RF receiver. 

Without the cascode transistor M2, the influence of Ml’s Miller capacitance can 
add quite significantly to that of the pad capacitance and cause the design to deviate 
from the ideal situations described by Eqs. (5.1)— (5.4). The bias current of Ml, for 
example, increases from 4 mA to 6 mA. The cascode transistor, on the other hand, may 
contribute some of its own noise to the amplifier, because Ml’s output impedance is 
low due to the very short channel length and high current. 

Having scaled down the contribution of the thermal noise of Ml’s channel by the 
Q of the matching network, the design of the LNA input stage for low noise figure 
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must then concentrate on parasitic noise sources that are not as well-defined as /£. 
Characterizing the transistors for NF before using them for the LNA, as is often done 
in traditional microwave LNA designs, is beyond our resources, because this would 
entail fabricating arrays of transistors of anticipated range of dimensions and bias 
currents, with anticipated layout. Since such devices are not designed to match the 
50 ft test environment, ‘de-embedding’ the influence of pads and other test fixtures 
and measuring the intrinsic noise figure at 1 GHz is a rather large undertaking. The 
minimal noise figure measurements made by the CMOS process provider typically 
come from transistors which have a specific W/L ratio and layout and are biased at 
unrealistically high currents (40-50 mA), and therefore do not provide us with much 
information either. In the absence of accurate RF models of MOS transistors backed by 
reliable experimental verification, the design for low NF of our LNAs had to combine 
general care in noise minimization with some specific, controlled experiments that 
were conceived to extract some data on the relative importance of certain parasitic 
noise sources in the overall noise figure. 

One such source is the induced gate current [18,19]. This noise source is important 
at higher microwave frequencies. Recently, a detailed analysis has been published [15] 
which not only points out that induced gate noise current is also important for CMOS 
at the low GHz frequency range, but also proposed ways of optimizing the LNA de- 
sign parameters, such as the matching network’s Q, to achieve minimum noise figure. 
Unfortunately, the predicted optimal NF was not confirmed out by the experimental 
results in [15] and so the subject is still wide open. 

Another important source of noise in CMOS LNAs which was recently reported is 
the backgating effect of the resistive substrate under the transistor channel [17]. In the 
case of the 0.25 pm CMOS we used, the substrate resistivity has a fairly high value 
of 5 ft-cm, which could have an even higher influence. The experimental data given 
in [17], however, reported a NF difference of 1 dB between a layout with sufficient 
substrate contacts to ground and a layout with fewer contacts, at a high level of 6-7 dB 
overall noise figure. The important question for us was whether there will be much 
difference for LNAs which already have low noise figures. 

In the initial stage of our development, we had succeeded in realizing a 0.25 pm 
CMOS LNA, which we refer to as LNA-1, for which care was taken in pad design, 
fingered gate layout, etc., so that the measured noise figure was as low as 2dB. The 
substrate around the gate of Ml was already well grounded by rows of contacts sur- 
rounding the periphery of the 600 pm gate formed by eighty 7.5 pm by 0.25 pm fin- 
gers, as shown in Fig. 5.5a. The input stage ofLNA-1 did not use the cascode transistor 
M2, so that the bias current was 6 mA and the actual matching network Q was close 
to unity due the influence of Miller capacitance. 

To investigate the influence of substrate resistance on achievable NF, we integrated 
a second LNA, LNA-2, that is nominally identical to as LNA-1, except that the layout 
of the gate of Ml was changed to allow more substrate contacts to be placed, as shown 
in Fig. 5.5b. The original LNA-1 was integrated again in the same run as a reference 
to avoid confusing NF variation from run to run with the NF improvement due to 
difference in layout. 

Realizing that Ml’s Miller capacitance is causing a significant deviation of the 
matching network gain from that given in (5.1), which results in 6 mA instead of 4 mA 
in the input stage, we also introduced the cascode transistor M2 at this stage. In 




58 



CIRCUITS AND SYSTEMS FOR WI RE LESS COMMUNICATIONS 




i 

a) b) 



Figure 5.5 Layout of transistor Ml : a) LNA-1 , b) LNA-2. 



doing so, the effective matching network Q increases from 1 to 2 which allows the 
bias current to be reduced to 4 mA. This circuit has been implemented as LNA-3 in 
the same run as LNA-1 and LNA-2. Although the change in Q as a result of going 
from a non-cascode to a cascode configuration is not exactly the same as designing 
for different matching network 0s for the same LNA configuration, we believe that if 
the matching network Q indeed has as strong an influence on the LNA’s NF as [15] 
seems to suggest, then we should see some evidence of this in comparisons of LNA-2 
and LNA-3. The gates of Ml in these two amplifiers have exactly the same layout. All 
three amplifiers should have noise figures less than or equal to 2dB. 

5.3.2 Output stage 

The overalltransconductance G m of the input stage is on the order of 100-200 mS. An 
effective load impedance between 50 and 100 £2 is therefore needed to realize 15 dB 
gain. In today’s CMOS technologies, however, the relatively high level of parasitic 
capacitance (C 2 ) tends to limit the achievable cut-off frequency. To make the load 
impedance broadband, shunt feedback in the output stage transimpedance amplifier 
in Fig. 5.3 is one of the most commonly used methods [1,20]. The loop gain of the 
shunt feedback, approximately given by gm^RL, scales down the input resistance of 
the output stage so that the parasitic capacitance C 2 does not cut off the amplifier’s 
frequency response as early as without feedback. The impedance Ri is the parallel 
combination of the output impedance of M3, the bias resistance R 3 and the 50 £2 input 
impedance of the interstage filter. 

If the loop gain is sufficiently large, then the transimpedance of the output stage in 
Fig. 5.3 is approximately given by the feedback resistance /? 2 , and the overall LNA 
gain is simply G m ft 2 - In practice, the limits of M3’s transconductance g m 3 due to 
current, IP3 and gate capacitance considerations imply that only moderate loop gain 
is achievable with an Rl that is dominated by the 50 £2 load. The LNA gain must be 
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designed on the basis of the more accurate design equation: 

= -V- f , 1 2 - + R t 11^1 WZdli] -w~r(i+8 m 3R2) , (5.5) 

«oL| Ll+g m3 «£, J^£ + /?2 

where Z^n is the output impedance of Ml or the M1-M2 cascode. 

The higher parasitics associated with PMOS transistors make them unsuitable for 
biasing purposes and resistors R i and /? 3 are used instead. The lack of voltage head- 
room combined with high bias currents constrain the values of R i and /? 3 to a couple 
of hundred ohms. In addition to (5.5), the choice of C?i,/? 2 ,* 3 as well as g m3 are 
constrained by two other important requirements: realizing a 50 £2 output resistance 
and providing a sufficiently high - 1 dB compression point (CP). The CP is limited by 
the maximum output voltage for a given bias current / 3 , 

C* = h-Z3 = h-l(R2 + Rl\\Zdl2)\\RL\ = h- [(*2 + *t 1 1 Zrfi2)P3 1 1 Zrf3 1 1 /*»] , 

(5.6) 



where Z 3 describes the impedance seen by the drain terminal of M3. Due to the 50 £2 
input impedance of the interstage filter, R$o, |Z 3 |is less than 50 £2 .This means that for 
a CP requirement of 0 dBm (300 mVp) the bias current / 3 must be no less than 6 mA. 
In practice R\ and (? 3 can not be very high because of both the voltage headroom and 
the need to implement the 50 £2 output impedance for the LNA. In the worst case, |Z 3 1 
can be as low as 25 £2, and twice as much current is needed. The constraint on RuR2 
and Ri in terms of 50 £2 output impedance can be expressed as 



Z 0 = 



/ *2 + *ll|Z<m \ 
\1 +gm3(tfll|Z<m)/ 



p 3 ||z d3 =50£2, 



(5.7) 



where it can be seen that in the absence of M3’s transconductance g m3 , the 50 £2 
output impedance would have to be implemented completely by R j, /? 2 and /? 3 ,in 
which case |Z 3 | in (5.6) would be as low as 25 £2. The design of the output stage is 
therefore primarily a trade-off of resistance values to satisfy (5.5)— (5.7) and minimize 
the required output current, while ensuring that M3’s gate overdrive is sufficiently high 
to maintain a high IP3. The latter requirement sets a limit to M3’s gate width. 

For LNA-1, a higher CP was sought, which led to higher output current. In the 
absence of the cascode, the current of the input stage is also higher, resulting in lower 
values of R i, /?2 and 2? 3 . This leads to an output bias current of 12 mA which gave us 
a measured output CP of -4 dBm. This output stage design has been left unchanged 
for LNA-2 and LNA-3, so that only one thing is different between the two successive 
designs. Further optimization of the output stage, which took advantage of the lower 
bias current of the input stage, allows the levels of R\, Ri and /? 3 to be somewhat 
higher than before, so that only half as much bias current is needed for the output 
stage, with only a slight drop in output compression point. The increase of resistance 
levels is also expected to result in lower equivalent noise referred to the drain of M2, so 
that the overall noise figure will be slightly lower. This design is referred to as LNA-4. 
The differences between the four designs and the key parameters of each design are 
summarized in Tab. 5.1. 




60 CIRCUITS AND SYSTEMS FOR WIRELESS COMMUNICATIONS 



Table 5.1 Summary of the implemented LNAs and their key design parameters. 





LNA-1 


LNA-2 


LNA-3 


LNA-4 


/(Ml) [mA] 


6 


6 


4 


4 


/(M3) [mA] 


12 


12 


12 


6 


Ml [pm] 


600 


600 


600 


600 


M2 [pm] 


- 


- 


600 


600 


M3 [pm] 


100 


100 


100 


300 


R1 [tt] 


235 


235 


400 


400 


R2 [ft] 


120 


120 


120 


120 


R3 m 


105 


105 


105 


255 


Sub. contact 


Fig. 5.5a 


Fig. 5.5b 


Fig. 5.5b 


Fig. 5.5b 



5.4 SINGLE AND DOUBLE-BALANCED MIXER 

The chief requirements of an RF mixer in a superhet receiver such as ours are 5 Off 
matching to the interstage filter, low noise figure, high IP3, and a moderate gain to 
reduce the noise contribution of subsequent stages to the receiver front-end. To satisfy 
all of the requirements, single -balanced (SBM) or double-balanced (DBM) mixers 
based on the Gilbert cell are practically left as the only choice, as evidenced by the 
fact that they are found in virtually every BJT RF front-end IC. 



if if if if 




Figure 5.6 Schematic diagram of a) single-balanced mixer, b) double-balanced mixer. 



In a single or double-balanced mixer (c.f. Fig. 5.6) the transconductor (£ m / im- 
plemented by the (input) RF transistor(s) is followed by commutating switches that 
perform the function of frequency translation. The effective conversion transconduc- 
tance of the mixer is g mi - /n for the single balanced configuration and 2 g m i /n for the 
double-balanced configuration, assuming ideal switching. Thus even in the ideal case 
the switches introduce a loss of \jn (-9.9 dB) for the SBM case and 2/7r (—3.9 dB) 
for the DBM case. With sinusoidal LO signals the switching is much less than ideal, 
and the loss is more likely - 1 2 dB for the SBM case and -6 dB for the DBM case. 
Because of this loss, the influence of noise current introduced by the switching transis- 
tors (during the time when each pair of switches remain in the on-state simultaneously, 
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for example), exceeds the noise of the RF input transistors). Combined with the inher- 
ent 3 dB NF degradation due to aliasing of the noise in the image sideband, the noise 
figure is therefore usually rather high for Gilbert mixers, 15 dB, for example, for BJT 
implementations. 

Because the expected NF is high for mixers, the input transistors’ contribution of 
2 dB or 3 dB to the overall NF is therefore not very critical. This justifies the use of 
the common gate configuration for the input stage, which can be designed to provide 
a resistive 50 £2 input matching to the filter without any external components. The g m 
of the input transistors is therefore 20 mS for the SBM case and 40 mS for the DBM 
case. 

In cellular applications, neighboring users often generate strong interfering signals 
in channels adjacent to one’s own. Since such interferers cannot be filtered by the 
RF or interstage filters and will be amplified by the LNA, the small-signal linearity 
(IP3) of the mixer must be sufficiently high to prevent intermodulation from degrading 
the bit-error rate. The type-approval requirement for GSM stipulates that the BER 
for reference sensitivity must be met in the presence of two -49 dBm interferers at 
800kHz and 1600kHz away from the desired signal, respectively, when the latter’s 
power is -99 dBm. The mixer’s IP3 requirement is directly linked to the combined 
receiver front-end gain, Ayp, before the mixer. If we assume again that 10 dB SNR 
is required to achieve the desired BER, then the required IP3 (in dBm) is bounded by 
the following inequality: 

-99 + Ai/f — 3(— 49 + Avf) — 9 + 2IP3> 10 (5.8) 

or IP3 > Ay f ~ 14.5 . (5.9) 

Considering that within the passband of each filter the gain of the signal could be 
different by 1 dB from that of the interferers due to passband ripple, the combined gain 
A y F could be 2-3 dB worse than the nominal value. The latter, assuming -3 dB pass- 
band loss for each filter and 15 dB for the LNA, is 9 dB. Thus the required IP3, assum- 
ing worst-case gain combinations for the signal and interference, should be -3 dBm 
to -2 dBm. Taking a 3 dB margin, the required IP3 for a small mobile station is in 
the order of 0 dBm. For a standard mobile station, the desired signal is 2 dB weaker 
(-101 dBm) and the interferences are 4dB (-45 dBm) stronger, so that the IP3 re- 
quirement can be as high as 7 dBm. 

In BJT implementations, such IP3s can only be achieved with some linearization 
technique such as emitter degeneration. In a CMOS mixer, as long as the transistor has 
sufficient overdrive so that it stays in strong inversion, achieving more than 0 dBm IP3 
is not a problem [14]. For a given input transconductance as required by impedance 
matching, however, it imposes a minimum-current requirement. In our designs, 4 mA 
was required for the single-balanced mixer and 6 mA for the double-balanced mixer, 
so that the transistor is biased halfway between weak and strong inversion. 

Although the single-balanced mixer consumes less power, it provides less conver- 
sion transconductance, lower linearity and higher noise figure. The local oscillator 
signal is not suppressed properly by the mixer, so that it feeds through to the output. 
In a superhet receiver, however, this LO feedthrough is suppressed by the IF filter. 
Since a typical 7 1 MHz IF filter requires a matching load resistor on each port to its 
330 £2 characteristic impedance, the RF mixer sees a 165 £2 load at its output. Even 
in the ideal case the conversion transconductance is 6.4 mS for the SBM and 12.7 mS 
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for the DBM, which is not enough to provide 10 dB conversion gain. A simple LC 
network is therefore required to transform the load impedance to the desired level in 
any case, and the 6 dB difference between the conversion g m s of single and double- 
balanced mixers is not critical as far as gain is concerned. 

Because of the higher (by 6 dB) loss due to the switches, the SBM is expected to 
have a higher NF than the DBM. Assuming that the noise current of the switches dom- 
inates the mixer noise figure, and taking into consideration that the DBM has twice 
as many switches which therefore generate twice the noise power, the input-referred 
noise figure of SBM is expected to be about 3 dB higher than the DBM. 

Although we favor the DBM for its expected higher performance, we have also 
implemented the SBM to check if the difference in NF is indeed as big as expected to 
justify the 50% higher current of the DBM. 

The design of the switches in the mixer is primarily a trade-off between having 
high W/L ratios to reduce the overlapping period in which both switches in each pair 
remain on, having low thermal noise current during this period, and limiting the pole 
introduced by the switches into the RF path outside the period, for a given current. 
In our design, the final switch sizes are similar to those of the input RF transistors, 
300 pm by 0.25 pm. 

5.5 THE TRANSMITTER PREAMPLIFIER 

Most modem GSM power amplifiers (PA) are constructed in a hybrid technology, us- 
ing bipolar or power-MOS transistors. They require 2 mW of driving power and a sup- 
ply voltage of 4.8 V (sometimes 6V). The mandatory power control is implemented 
in the PA, and the output power can be varied by adjusting the DC voltage applied to 
a control pin. The peak power of a class-4 PA is as high as 2 W. Although its average 
transmitted power can be much lower because the average distance between the mobile 
and the base station is much less than the maximum radius of the cells in the network, 
the average power consumption of a mobile station is still high, judging by the low 
talk time advertised for current commercial handsets. Given that power consumption 
is dominated by the PA, the power consumption of the preamp and modulator is not as 
critical as in the receiver case. Flowever, attention is paid to low-power design in the 
preamp so that the design will also be suitable for mobile stations with lower classes 
of PAs. 

In this work, the transmit SAW filter required to remove the harmonics of the carrier 
and out of band noise is placed between the up-conversion mixers and the preamplifier. 
Since the loss of the filter precedes the preamp, the preamp is only required to deliver 
3 dBm power to the PA instead of 7.5 dBm. This will ease the problems of delivering a 
substantial amount ofpower using CMOS, including efficiency and substrate coupling. 

Since the efficiency of the preamp is not as critical as the PA, a class-C approach 
requiring another expensive and lossy external filter before the PA is neither necessary 
nor does it save power. Due to the lack of good PMOS transistors, a class-B (push- 
pull) approach is also unavailable. If the preamp output harmonics that could degrade 
the PA efficiency are kept low, a class-A amplifier can drive the PA without the external 
filter. The primary requirements for our preamp, a two-stage class-A implementation 
as shown in Fig. 5.7, are thus 50 matching at both the input and the output, a high 
gain of 23 dB, and a high power output (for CMOS) of 3 dBm with sufficient linearity. 
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The first stage, comprising Ml, its biasing and the matching network, is similar 
to the input stage of the LNA, therefore the same design equations hold. The main 
difference here is the relatively high input level of -18 dBm. Since IM3 is inversely 
proportional to the overdrive of a MOS transistor, a relatively small transistor, biased 
at 1 1 mA, has been used for Ml . Because of the relatively large overdrive of Ml that 
leaves little voltage headroom under a 2.5 V supply, no cascode transistor has been 
used in this design to ensure that Ml stays in saturation. 

The second stage is a transimpedance amplifier, biased at 16mA, as required to 
deliver 2 mW into the 50 £2 load with sufficient linearity. The relationship between the 
required output compression point and the required bias current is similar to that of 
the LNA. Unlike the LNA, a resistive load is ruled out by bias voltage considerations, 
otherwise *3 would have to be approximately 60 £2, which would make high gain 
very difficult to achieve. Instead we use inductive loading, and the RF choke L 3 
resonates with the drain capacitance of M2 at 900 MHz so that the path to Vdd appears 
to be open, and the formula for the output compression point (5.6) and the output 
impedance (5.7) can be used with R 3 set to infinity. In fact, £3 and the coupling 
capacitance C 2 could also be used as an impedance matching network at the same 
time, so that the impedance seen by the drain of M2 could be higher. That would 
allow higher power to be delivered for a given current. Due to the low supply voltage 
(2.5 V) of the 0.25 pm CMOS technology, this option cannot be used in our design, 
because impedance scaling by the matching network also scales the voltage swing of 
the drain of M2. Another way of achieving higher output power in traditional preamp 
designs is to combine such a matching network with biasing the drain of M2 at Vdd, 
which doubles the allowable voltage swing. Since the margin between the breakdown 
voltage and the actual supply voltage is small compared to the output swing in our 
0.25 pm CMOS case, this option (which entails swinging the drain of M2 substantially 
above Vdd) was also ruled out on reliability grounds. To bias M2 in the mid-supply, 
resistor R 3 of approx. 60 £2 is inserted before £3 and AC-bypassed by C3 as shown. 
At DC, R->, also helps to stabilize the bias condition through R7. 

In many bipolar transceivers [3,5] a buffer capable of delivering 2 mW into 50 £2 
has been implemented, frequently as an emitter follower biased at about 10mA to 
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achieve high IP3. Implementing a 20 mS (50ft)bufferin0.25 pm CMOS, however, 
is not possible because of the large voltage swing required at the gate. Delivering 
2 rnW to 50 ft corresponds to having a voltage swing of 900 mV peak-to-peak at 
the transistor’s source and 1.8V peat-to-peak at its gate. After adding the threshold 
voltage (400 mV) and some overdrive (200 mV) we have a peak voltage of 2.4 V, which 
is already higher than the specified minimum supply voltage. To reduce the swing, a 
lower impedance buffer followed by a matching network could be used, but this would 
result in an excessive increase in current consumption. 

In our class-A implementation, the total current consumption is only 27 mA. Such 
a current level, required by linearity, is insignificant compared to that of the final PA. 

5.6 POWER CONSUMPTION 

Before describing the experimental results, it is perhaps appropriate to briefly revisit 
the subject of power consumption of the transceiver. Earlier we stated that for CMOS 
technologies to be seriously considered as a contender for RF front-ends in cellular 
applications and as a competitor to BIT technologies, one must a ) prove that the 
complete set of performance specifications can be met and b ) do so with a cunent 
consumption comparable to today’s commercial solutions. In this section we wish to 
highlight the current consumption aspect. 

Low power consumption is most important in the receive mode, because it defines 
the stand-by time. In transmit mode power consumption will be dominated by the PA. 
As the chief reason to purchase a mobile phone shifts from business use to private use, 
the toll charges serve as a natural barrier to reduce the occurrences of private users 
overrunning the, say, four hours of talk time possible on a single battery charge. Busi- 
ness and private users alike, however, may find a 40 hour or even a 72 hour stand-by 
time inconvenient. 

The designs reported here show that by combining LNA-4 and the double-balanced 
mixer, the nominal supply current of the front-end chip can be as low as 16 mA. On- 
going design and experimental work on the remaining part of the receiver gives us 
good reasons to believe that the IF-strip plus the demodulator will consume no more 
than 6 mA, which shows that the front-end is indeed the dominant consumer of power 
(over 70%) of the receiver. Even if we add the VCO and the prescaler, which domi- 
nate the current consumption in a synthesizer (3 mA in Rx mode), the total nominal 
current consumption will only be 25 mA, i.e. 60 mW power consumption. This com- 
pares very favorably with existing chipsets, such as those reported in [2] (51 mA in 
the paper, 46 mA total in the datasheet), [3] (57 mA), and even [4] (33 mA), for exam- 
ple. The last reference derives its high performance from an advanced 25 GHz bipolar 
technology. 

5.7 MEASURED RESULTS 

All measurements were performed using bare chips assembled on a PCB with all the 
needed external SMD components. The dies were directly glued to the PCB ground 
plane which provides an excellent ground connection to the chip substrate. Some 
care was taken with the length of the bondwire to the source of Ml in each LNA 
and the preamplifier so that correct matching is achieved. No external trimming was 
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performed. The supply voltage was set to its nominal value of 2.5 V. Among the chips 
measured, one is a combination of LNA-1, the SBM mixer and an early version of 
the IF-AGC amplifier [11]. Fig. 5.8 shows its micrograph. The measured parameters 
of the implemented LNAs, mixers, the transmit preamplifier as well as the particular 
receiver front-end are summarized in Tab. 5.2. 




Figure 5.8 Photomicrograph of front-end 1C. 



Table 5.2 Summary of measured front-end circuit performances. 





LNA-1 


LNA-2 


LNAS 


LNAS 


SBM 


DBM 


TX-PRE 


RX-FE 


Gain [dB] 


13.8 


14.3 


14.3 


16.4 






21.1 


85 max 


Conv. g m [mS] 










4.9 


8.5 






SSB NF[dB] 


2.0 


1.74 


1.97 


1.6 


15.9 


12.6 






iIP3 [dBm] 


-2.0 


-2.8 




-7.3 




9.0 






iCP [dBm] 




-17 




-20 


-4 


-4 


-14.6 




SI 1 [dB] 


-5.6 


-6.8 


-4.0 


-8.0 


<-15 


-11 


-6.6 




S22[dB] 


-8 


-22 


-11 


-12 






-8.5 




S12[dB] 


<-30 


< -30 


< -40 


< -40 










Idd [mA] 


18.3 


18.1 


16.6 


10.8 


3.5 


6.0 


27.2 


25 



For the low-noise amplifiers, we are particularly interested in the noise figures, 
as well as the influence of the various refinements on the NF in our controlled ex- 
periments. The reference amplifier LNA-1 achieves again a low NF of 2dB, while 
providing the expected gain, input and output impedances and excellent linearity. The 
fact that a 2 dB NF has been achieved over consecutive runs shows that the design is 
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suitable for manufacturing. With merely an increased number of substrate contacts, 
(of a transistor that already has many contacts,) LNA-2 shows a significant improve- 
ment in noise figure, to 1.74 dB. This demonstrates the significant contribution of 
back-gate resistance to the overall NF, at least for the 5 fi-cm substrate of the technol- 
ogy we used. The main influence of the cascode transistor in the input stage (LNA-3) 
is a 30% reduction in current consumption in the first stage due to the higher Q and 
lower g m needed, and a 10 dB improvement in reverse isolation. The price paid is a 
slight increase of the noise figure which we believe to be due to the additional con- 
tribution of M2. No rapid improvement of the NF, as predicted for the particular 
0.35 pm LNA in [15] for a similar change of Q , is observed here. The last amplifier, 
LNA-4, achieves the best performance trade-off for our application in terms of gain, 
noise figure and power consumption, thanks to a better design of the output stage. The 
measured output compression point (-5 dBm oCP) has only decreased slightly (1 dB) 
from that of the other 3 LNAs despite saving half of the output stage current. A very 
low noise figure of 1.6 dB is measured, which is near the optimum noise match. This 
is the best result reported to date for an integrated CMOS implementation. In Fig. 5.9, 
starting with the minimum noise figure close to the center of the Smith chart, con- 
stant NF and gain circles have been measured in terms of the source impedance seen 
from the noise-matched LNA. Thanks to the low - Q design, these circles are big and 
lwidely separated for small increments of gain and NF. This means that variations the 
of matching network impedance due to component tolerances in production will not 
cause drastic degradation of gain and NF. Indeed, in 50 £2 power-matched conditions, 
the measurements of LNA-4 in Fig. 5.10 show that 15 dB gain, -8 dB Sll and 1.9 dB 
NF can be achieved simultaneously. Due to the effect of the bond pad, the protection 
diode and the PCB parasitic capacitances at both sides of Li, the actual 50 Q power- 
match conditions of the LNA are slightly different from the ideal equations (including 
(5.2) in this paper) normally described in recent publications [14-16]. 

Turning our attention to linearity, Fig. 5.11 shows the measured iCP and iIP3 of 
LNA-2 and LNA-4. High IP3 values far exceeding requirements have been measured, 
confirming the intrinsic high linearity of MOS devices. The measured compression 
points meet the requirements of the blocking test and are 3^1 dB below the limit set 
by the output current, as expected. 

The measurements of the SB and DB mixers have been performed using -1.5 dBm 
of LO power with 50 £2 terminations at the LO ports. The measured conversion g m s 
are 4.9 mS and 8.5 mS for the SB and the DB mixer, respectively. Due to imperfect 
switching, they are about 30% lower than the theoretical maximum, which was ex- 
pected. Although the measured NF of 15.9dB for the SB mixer is higher than we 
would like it to be, it is in line with most reported mixers with gain at this frequency, 
including BJT implementations [1,3]. The measured 12.6 dB SSB NF of the DB mixer, 
on the other hand, is one of the best reported to date for fully integrated mixers, and 
facilitates the use of RF and interstage filters with moderate specs (and hence price) 
in superhet GSM receivers. The Sll measurements are shown in Fig. 5.12. The com- 
mon gate input guarantees excellent input matching with a broadband characteristic 
for both mixers. Indeed, the measured SI 1 is limited by the balun needed to match the 
DB mixer, but nevertheless stays under -10 dB at all frequencies below 1.2 GHz. The 
Sll degradation at higher frequencies is due to the increasing influence of parasitic 
capacitances. Fig. 5.13 shows that the measured iIP3 is as high as 9 dBm, and iCP 




GSM TRANSCEIVER FRONT-END CIRCUITS IN 0.25 |jm CMOS 




Figure 5.10 Gain, NF and S1 1 of impedance-matched LNA-4. 



-4 dBm, for the DB mixer, which thus meets the specs outlined earlier by a comfort- 
able margin. This has been achieved, without source degeneration, by the right choice 
of the g m /I ratio of the CMOS devices, and by differential implementation. 

The measurements of the transmitting preamplifier show that high gain, adequate in- 
put and output impedances as well as good linearity can be achieved even if a relatively 
high amount of power must be delivered. Fig. 5.14 shows the measured frequency re- 
sponse of the TX preamplifier. The measured iCP of the preamp is -14.6 dBm, which 
corresponds to +5.8 dBm oCP. This translates into more than 3.5 mW of power deliv- 
ered to the 50 £2 load at compression. 
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Figure 5.11 Measured, input-referred third order intercept point ilP3 and -1 dB com- 
pression point iCP for LNA-2 and LNA-4, respectively. 
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Figure 5.12 Measured S1 1 for single-balanced mixer and double-balanced mixer. 



5.8 CONCLUSIONS 

Power consumption is more important than the cost of chips in today’s cellular phones. 
To prove that CMOS is feasible and competitive with BJTs for implementing the RF 
front-end of a GSM receiver, circuits operating at radio frequencies must show com- 
petitive current consumption while meeting all specs required for type approval. Type 
approval requirements have been discussed and translated into circuit specs where 
necessary, and measurements of the front-end integrated circuits reported in this paper 
show that they are indeed met. We have shown that current consumption lower than 
today’s BJT implementations is indeed feasible by carefully optimizing the front-end 
design. Excellent noise figures are also shown to be reproducibly obtained for CMOS 
LNAs and mixers. Controlled experiments also confirm that backgating by substrate 
resistance can degrade the NF, and as many contacts as possible should be used to 
ground the substrate. 
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Figure 5.13 Measured, input-referred third order intercept point iiP3 and -1 dB com- 
pression point iCP, respectively, of the double-balanced mixer. 
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Figure 5.14 Measured frequency response of the TX preamplifier. 
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6.1 INTRODUCTION TO THE FOLLOWING PAPERS 

To newcomers to the field of RF front-end design for wireless communications, the 
combination of rapid change in commercial standards and requirements, the choice 
of system architectures and frequency plans, and their relation to the performance of 
individual radio frequency circuits can be quite bewildering. The four contributions 
included in this chapter address these issues from different perspectives in varying 
degrees of detail. 

The first two contributions come from academia and are biased towards basic the- 
ory. The second two contributions come from industry, providing more insight into the 
current state-of-the-art in wireless transceivers, and the overall system requirements in 
terms of cost, size (volume), and performance. 

Qiuting Huang focuses on the fundamental issue of phase noise in RF oscillators, 
which is particularly important for wireless transceivers as it strongly affects the rel- 
ative independence between different radio channels and services that transmit and 
receive signals of varying strengths simultaneously. In addition to avoiding unsound 
assumptions, the analyses of oscillation amplitude, oscillator response to interfering 
signals, and the identification of white noise sources have all been backed up with 
controlled experiments. 

Behzad Razavi exposes the fundamental issues of frequency planning in transceiver 
design, as well as recent trends in highly integrated RF circuit design, in the context 
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of a GSM/DCS dual-mode transceiver. The architecture is designed for a high level of 
integration, containing neither external image -reject filters nor IF filters. Quadrature 
mixers are therefore frequently used, and the LO signal is placed halfway between the 
GSM band and the DCS band to save hardware. In the transmitter part, the tutorial 
on GMSK modulation and influence of nonlinearity on the transmitted spectrum may 
be a very useful starting material to read for people new to the field. The emphasis in 
implementation is on CMOS, with extensive use of on-chip spiral inductors. 

Petteri Alinikula shares with us the challenges of future wireless terminals from the 
viewpoint of one of the top three leading commercial companies in the market. Wide- 
band CDMA transceivers appear to be the next target for most, especially in light of 
the recent agreement between Ericsson and Qualcomm over IPR issues. Linear power 
amplifiers, a sharp increase in DSP computing power, and multi-standard terminals are 
identified as major challenges facing designers of next-generation terminals. The au- 
thor also shares with us his view of the evolution in state-of-the-art terminals, in terms 
of relative physical volume, component count and cost breakdown. Many readers will 
find the corresponding figures interesting. Also interesting is a concrete example of a 
direct-conversion architecture being used to save discretes and filters. 

Stephan Heinen and Stephan fferzinger take us back to the GSM transceiver again, 
from the perspective of industry. They echo the view from Alinikula that overall sys- 
tem optimization is more important than the focus on a particular technology, the 
saving of a particular filter or the performance of a particular building block. In the 
discussion of GSM transmitters, the general requirements by type approval are pre- 
sented in detail, as are the effects of nonlinearities on the transmitted spectrum. This 
again gives a newcomer to the field a nice introduction, from an angle differing slightly 
from Razavi’s. Direct up-conversion, IF up-conversion and modulation-loop VCO ar- 
chitectures are discussed in terms of their relative merits and disadvantages, and the 
modulation loop is identified as the transmitter architecture of choice for constant- 
envelope modulation schemes such as GMSK and is presented in detail. 

The contributions in this chapter provide a good overview of the current issues in 
RF front-end design, as the field of RF-IC design moves towards maturity, and the 
industry is poised to introduce the next generation of air-interface standards. 
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Abstract: The following questions are addressed in this chapter using a CMOS 
Colpitts oscillator as an example. 

1. How does the carrier amplitude depend on the electrical parameters of the oscil- 
lator? 

2. Does an additive interference, white noise for example, (always) become multi- 
plicative with the carrier in such a way that it deserves the name phase noise ? Is the 
response to such an interference always inversely proportional to its frequency offset 
from the carrier? Why? 

3. How does the phase-noise-to-carrier ratio depend on the electrical parameters of 
the oscillator? Can such a dependence be described directly with the circuit parameters 
rather than intermediate quantities such as the signal power consumed by the tank, the 
mean-squared voltage amplitude, or the noise factor of the oscillator? 

Answers to the above questions afford us a better understanding of LC oscillators 
and enable us to predict the exact oscillator performance without resorting to numerical 
simulation. 
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7.1 INTRODUCTION 

The importance of phase noise in oscillators in RF and other communication circuits 
has made it one of the most extensively studied subjects in electronics. Journal pa- 
pers on the subject [1-10] can be found from each of the last six decades, the earliest 
one [1] being as old as 1938, and the latest [10] as recent as 1998 The fact that papers 
on LC oscillators, too numerous in the last 60 years to cite exhaustively, keep appear- 
ing, seives to underline the fact that existing theories on phase noise have not been 
universally found satisfactory. 

Experimentally, the qualitative behaviour of phase noise has been well known. An 
oscillator’s output power spectrum consists of a peak at the carrier frequency (main 
oscillation, angular frequency coo), surrounded by a noise skirt symmetrical to the 
carrier frequency. Irrespective of the exact implementation, the noise skirt has the 
following characteristics: 

■ The noise spectral density is inversely proportional to the frequency offset from the 
carrier, except very close to the carrier frequency, where the influence of up-con- 
verted flicker noise dominates or the presence of the strong carrier begins to limit 
the measurement accuracy. 

■ The same noise manifests itself in the time domain as jitter around the oscillation’s 
zero-crossing points, which can only be caused by noise in the phase of the oscil- 
lation rather than that super-imposed on its amplitude. Oscillator noise is therefore 
usually referred to as phase noise. The assumption of phase noise implies that the 
sideband spectrum above and below the carrier frequency must be equal in ampli- 
tude and opposite in sign. It also implies that output noise is multiplied with rather 
than added to the carrier. 

■ It is widely believed [11] that doubling the quality factor of the LC tank roughly 
improves the oscillator’s canier-to-noise spectral density (C/N) ratio by 6dB, al- 
though systematic experimental verification of such a belief is hard to find in open 
literature. 

Today, RF oscillators are implemented with high-quality inductors and capacitors 
outside the chip containing the active devices. This is the main reason for the wide- 
spread efforts in the integrated circuit community to improve the quality of fully in- 
tegrated inductors on silicon, especially in the last few years [1 1-25]. Despite the 
avalanche of research papers, the quality factor Q of on-chip inductors at 1 GHz has 
hardly improved by a factor of two (from Q < 3 to Q < 6) in the last 10 years. Re- 
ported N/C ratios, however, vary by as much as 20 dB at 100 kHz offset. This under- 
lines the fact that other factors than the Q of the tank also affect the N/C ratio strongly. 
Our understanding of these factors seems at times confused. 

The confusion stems mainly from the inadequacy of existing phase noise models. In 
the next section, we will review typical existing models and analyze their weaknesses. 
This will be followed by our own analysis of a typical LC oscillator. The example used 
is the well-known Colpitts oscillator implemented with MOS transistors. The analysis 
starts in Section 7.3 with the general description of steady-state oscillation and the 
derivation of the exact oscillation amplitude as a function of circuit parameters. It will 
be followed by Section 7.4, describing the response of the oscillator to a small, deter- 
ministic interfering signal. Section 7.5 identifies the spectral density of all white noise 
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sources, leading to an analytical expression of oscillator output noise-spectral-den- 
sity-to-carrier ratio that is completely determined by circuit parameters. Experimental 
results will then be presented to validate the analytical expression, before the latter is 
used in the discussion of the exact design of RF oscillators. 

7.2 THE WEAKNESSES OF EXISTING PHASE NOISE MODELS 

Existing models in the last six decades can be grouped by the four typical methods 
used to derive them: 

■ Linear, frequency-domain analyses with noise sources additive to the carrier signal, 

■ linear, frequency-domain analyses with noise sources additive to the phase of the 
carrier, 

■ time-domain analyses with noise sources additive to the carrier signal, 

■ time-domain analyses with noise sources additive to the phase of the carrier. 

Since all the models set out to fit the well-known experimental observations de- 
scribed in the previous section, the resulting expressions do qualitatively predict many 
aspects of phase noise behaviour. It is in the validity of the reasoning that led to those 
expressions that we find serious weaknesses. 

7.2.1 Linear, frequency-domain analyses with noise sources additive 
to the carrier signal 

Linear analyses simply assume that noise is superimposed on the carrier and the trans- 
fer function from the noise source to the oscillator output is that of the resonator tank, 
modified by the equivalent linear transconductance of the active device. Fig. 7.1 shows 
the equivalent circuit of the simplest case [9]. The V-I characteristic of a real active 
device, on the other hand, can generally be depicted as in Fig. 7.2, the ~ shape serv- 
ing as a necessary feature to limit the oscillation amplitude [26]. The negative slope 
representing the negative (trans)conductance required for start-up is usually found in 
the middle. Since this V-I curve is usually highly nonlinear, the approaches to the 
equivalent linear transconductance ( g m ) have been different. 




Figure 7.1 Simple model used in typical linear analyses. 
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Figure 7.2 Typical description ot V-l characteristic of an active device and its average 
gm for a large signal. Practical transistors can be even more nonlinear. lg and V B are 
the bias current and voltage, respectively. 



Earlier analyses [2] allow the equivalent g m to be quite different from the critical 
transconductance ( g mc ) required to cancel the loss of the tank exactly. Such analyses 
result in models that generally predict a bandpass characteristic centered at the carrier 
frequency, flat at frequencies immediately adjacent to the carrier and rolling off at 6 dB 
per octave at higher offset frequencies. The comer between the two offset frequency 
ranges is typically of the order of the carrier frequency divided by the Q of the passive 
tank. One of the serious problems with such a model is that the predicted corner 
frequency, which can be 10 MHz for a 1 GHz oscillator and a Q of 100, is never 
observed in practice. 

To fit them to experimental results, later analyses [8,9] assume that the transcon- 
ductance (or gain) of the active device, seen by the noise signal, is exactly the same 
as that for the carrier, which in turn must be the same as that required to cancel the 
tank loss in steady state. The resulting equivalent circuit is an ideal, zero-loss linear 
resonator. The inverse dependency on offset frequency is now present all the way to 
the carrier. 

The fundamental objection to the use of the carrier’s g m is that the latter is only a 
crude large-signal concept indicating the ratio between the fundamental component of 
a distorted output current of a transistor and the voltage applied to its input. The noise 
component, being a smaller signal at a different frequency and phase, does not traverse 
the nonlinear V-I curve in Fig. 7.2 in the same way as the carrier, even though it is 
superimposed on the latter. The distortion it experiences, or the average g m applicable, 
will therefore also be different from that of the carrier. A specific example of the 
CMOS Colpitts oscillator will be shown later to underline this point. If one still insists 
on linear superposition of the carrier and noise, there is no more justification left than 
the need to fit the theory to the observed 1/Arn characteristic. Even so, consistency 
then requires that the small-signal g m at the center of Fig. 7.2 be used, because the 
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noise signal is a very small signal. However, the small-signal g m at the center is 
necessarily several times larger than the critical g m (gmc), otherwise oscillation would 
never have started! Further evidence that the type of linear analysis in [8,9] is wrong 
lies in the fact that it predicts a response 6 dB too high at low offset frequencies. This 
last point will be shown later in this paper. 

Fundamental objections can also be raised about the linear models for what they 
predict. If noise were merely added to the oscillation signal, it would not cause the 
zero-crossing-point jitter normally observed, even if the 1/A co characteristic is given. 
This is because the odd symmetry in phase with respect to the carrier frequency, inher- 
ent to modulation in phase, is absent in the predicted oscillator noise spectrum. 

7.2.2 Linear frequency domain analyses with noise sources additive 
to the phase of the carrier 

Papers in this class also use a linear model, but consider the noise sources to be added 
directly to the phase of the carrier [6,7]. The result is best known as Leeson’s Model or 
Leeson’s Formula. Although its original derivation is at best heuristic, it has become 
the most widely accepted model for phase noise in the last 30 years. In addition to 
being widely used in industry, it has been adopted in several reputable textbooks. The 
reasons for its success seem to be three-fold: 1. It is derived from phase so that the 
odd symmetry is assumed to have been taken care of. 2. The 1 / Am characteristic is 
there. 3. It predicts a perceived inverse dependence of the N/C ratio on the (loaded) Q 
of the tank. It is up to the users of the model to work out what the power level of the 
signal ( P s ) and the effective noise figure (F) of the oscillator mean. 

The fundamental problem with the model’s derivation lies with the noise sources. 
For white noise to be additive to the phase directly, it must be located at places where 
it can influence the frequency or phase-setting elements of the oscillator, such as the 
junction diodes (intentional or parasitic) in series or parallel with the main linear, pas- 
sive tank. Oscillators with negligible nonlinear LC elements would have no phase 
noise, or zero effective noise figure F, if we have to express the noise in terms of 
Leeson’s model, but we know this would contradict reality. Even for oscillators with 
nonlinear capacitors, it is quite unclear why the phase or frequency shift caused by 
white noise on the reverse bias of a junction diode would have to do with anything 
other than the C-V characteristic of the capacitor and its importance relative to other 
linear capacitors in the tank in determining the oscillation frequency. The introduc- 
tion of P s into the “input phase noise’’ seems totally arbitrary, even though doing so 
results in a model that seems to reflect its role in the final expression of the N/C ratio 
qualitatively. Numerical verifications with practical oscillators also show that noise 
levels associated with junction diodes are too low compared to the noise observed at 
the oscillator output, not least because good oscillators are designed to minimize the 
influence of parasitic diodes, and the bias circuits of frequency-setting varactors usu- 
ally have lowpass characteristics with cutoff frequencies well below the oscillation 
frequency. 
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7.2.3 Time-domain analyses with noise sources additive to the carrier 
signal 

Typical analyses [4,5] in this group model the nonlinearity of the active device with 
a third-order term and describe the dynamics of the oscillator with a Van-de-Pol dif- 
ferential equation driven by the noise source. Without pinpointing the precise flaws in 
those usually complicated derivations, it suffices to say that a constant term (yin [4]) 
is usually defined which is of the order of magnitude of the relative difference between 
the small signal g m of the active device and gmc- This term divided by Q defines the 
corner of the relative offset frequency (i.e. relative to the carrier) of the final output 
noise spectrum which has a bandpass shape. Again, the flat portion of the spectrum 
below such a comer frequency is never observed in practical oscillator measurements. 

7.2.4 Time domain analyses with noise sources additive to the phase 
of the carrier 

In the most recent theoretical journal paper [10] on this subject, a general theory is 
proposed which claims to be applicable not only to LC oscillators, but also to all other 
types such as ring oscillators. We restrict the focus of our discussion on LC oscillators. 
The main weakness in the reasoning in this paper lies in the assumption that interfer- 
ence is added to the phase. The example given to support such an assumption is an 
ideal tank, excited by a current impulse. The sinusoidal signal after a charge has been 
deposited on the capacitor by the impulse is phase-shifted and changed in amplitude 
relative to the sinusoidal signal before the impulse, were the latter to continue. It is 
correct that both the phase shift and the amplitude change are periodic functions of the 
time of the impulse, although the statement that at maximum phase shift the amplitude 
change is zero is wrong, because it is not possible to add energy (QAq/C + &q 2 /2C, 
if Q} /2C is the total energy before the injection of charge) to an ideal tank without 
changing the amplitude of the oscillation, no matter when the small charge A q is in- 
jected into the tank. The critical step of the paper’s argument comes when it is stated 
that some form of AGC action always exists in an oscillator so that the amplitude 
finally settles back to the same value independent of the impulse. For the ideal LC 
tank this statement is clearly wrong. For the lossy tank in a practical oscillator this 
may be true, but we need to highlight the word finally , for it usually takes hundreds 
of oscillation cycles for the impulse to settle (precisely because AGCs, if needed, are 
designed to be insensitive to small-signal, high-frequency perturbations). This is in 
stark contrast to the case of ring oscillators constructed by CMOS inverters, where the 
AGC function may consist of clipping at a reference voltage such as the power supply, 
which may be achieved within the same oscillation cycle. 

Since the impulse response of the amplitude does not decay to zero instantly, it 
contains as much information about the overall response of the LC tank to the injected 
current as the phase impulse response h,p(t) constructed in [10]. The same comment 
holds for the convolution between the impulse responses and the injected continu- 
ous-time current signal, which results in the amplitude and phase responses defined 
in [10]. The two responses are also strongly correlated. Throwing away the amplitude 
response, as it has been done in [10], destroys a part of the information needed to 
re -construct the overall response. The nonlinearity required to convert additive noise 
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to phase noise and generate the double-sideband spectrum is thus created more by the 
AGC assumption than any real transistors. To demonstrate further the fallacy of throw- 
ing away the amplitude response and using only the phase response to construct the 
output spectrum, we note that even for an ideal, passive LC tank with zero loss and 
resonance frequency coo, perturbed by an ideal sinusoidal current source at coo + A co, 
a response will appear at coq — A co according to [10]! General theorems about linear 
systems and the specific solution to the linear differential equation with a sinusoidal 
driving function tell us this is not possible. 

Even if an instantaneous AGC did exist, the model still has a serious problem. Since 
the derivation is independent of the offset frequency Aco, it should be valid even at 
offsets reasonably far outside the tank’s passband, where the tank impedance becomes 
very low. For a high- Q tank, the voltage produced on the tank by an interfering current 
is so small at high offsets that the AGC should not have any influence. The attenuation 
is also going to be so high that the active device will not havesufficient g m to produce 
enough current to counteract the injected current. In such a situation, one would expect 
the relationship between the interfering current and the voltage it produces on the tank 
to be the same as that of a passive tank without the active device. In this case, the model 
based on the linear analysis [9] must be valid, although the resulting interference can 
no longer be called phase noise due to the absence of the image sideband. The result 
of [10] in this case is wrong by a factor of two. 

Most models described above attempt to be general. In doing so they fail to include 
information on parameters vital to the use of the models. A typical example is the ef- 
fective noise figure F, which must be calculated taking into account all noise sources, 
their location in the circuit and the amount of distortion the noise experiences prior 
to reaching the tank. Another example is oscillation amplitude or power, which may 
depend on the Q of the tank. With the effective noise source and oscillation amplitude 
as intermediate variables, the exact relationships between N/C ratio and the circuit 
parameters such as tank loss, L and C values and transistor g m are obscured. This 
sometimes makes the models open to interpretation, which is not entirely satisfactory 
to the designer. 

In the following, we take the approach of analyzing a very specific oscillator, the 
Colpitts oscillator in Fig. 7.3a, in the time domain, so as to remove as much ambiguity 
about the assumptions as possible. Generalizations will be made only after the specific 
case is well established. Noise currents or voltages as they are introduced by the 
sources such as resistors and transistors are treated exactly as they are in the physical 
reality: additive to the carrier signal. 

7.3 GENERAL DESCRIPTION OF LC OSCILLATOR OPERATION AND 
DETERMINATION OF OSCILLATION AMPLITUDE 

Before one attempts the analysis of the influence of noise on oscillation, it is helpful to 
study first how a typical LC oscillator operates in steady-state. In the CMOS case, the 
study will also have the benefit of revealing an analytical expression for the oscillation 
amplitude [27]. The determination of the CMOS LC oscillator amplitude had always 
relied on some form of numerical solution. The most straight-forward method is to 
use a circuit simulator such as SPICE. An earlier analysis [28] shed some light on the 
way a MOS transistor operates in a Colpitts oscillator, but stopped short at parametric 
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Figure 7.3 (a) Minimum representation of a Colpitts oscillator, (b) Equivalent circuit 
with transistor represented by a linear transconductance, (c) Equivalent negative resis- 
tance model of oscillator. 



equations that require numerical solutions to obtain oscillation amplitude. The prob- 
lem with numerical solutions, however, is that insight into the way circuit parameters 
affect the final amplitude is partially or completely lost. 

Standard descriptions of LC oscillators are usually limited to the necessary condi- 
tion for oscillation. For small signals, the MOS transistor can be modeled as a linear 
g m , as shown in Fig. 7.3b. All the resistive losses of the tank have been lumped into 
a single resistor R s . The transistor forms a positive feedback loop with capacitors 
C i and C 2 . The equivalent impedance Zr looking to the right of the dashed line in 
Fig. 7.3b contains a negative real part: 

z ' = T = ^c + ^ S) = }^ct + 7zk'^fe’ <71) 

The necessary condition for the oscillation to start is that the total resistance in the 
equivalent LC tank in Fig. 7.3c be negative, 

8m > 8mc = (>JqC\C2Rs » (7.2) 

where gmc is defined as the critical transconductance for oscillation, and cdq = 1 /VLC 
is the tank’s angular resonance frequency. The negative middle term in the circuit’s 
characteristic equation results in the latter’s complex conjugate zeros having a positive 
real part. The zero-input response of the circuit is therefore an exponentially increas- 
ing sinusoidal oscillation. Equation (7.2) does not tell us whether the oscillation will 
stabilize to a particular amplitude. 

During start-up, the average current in transistor M\ may not be the same as the 
bias current. The difference between the two flows into the capacitive network formed 
by Ci and C 2 , causing the DC bias voltage between the gate and source of M\ , Vb, to 
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shift. This will in turn change the average current in the direction of the bias current. 
The steady state is reached when the two are balanced. Although the non-linearity of 
Mi’s V-I characteristic results in many harmonics in M\ ’s drain-source current, the 
latter is periodic in steady state. Fourier series expansion can therefore be used in the 
analysis. The steady-state oscillation amplitude between M\ ’s gate and source, V m , 
and the DC bias are bound by two constraints if the quality factor of the passive 
tank is high, so that the voltage on M\ ’s gate is practically a pure sine wave. 

1. The average current through Mi is equal to the bias current Iq. (/o corresponds to 
I B in Fig. 7.2.) 

2. The ratio between the amplitude of Mi ’s AC current and that of its gate-source 
voltage is exactly the transconductance of the passive tank between the branch 
and the I s branch in Fig. 7.3b at resonance, which happens to be g mc . 

These two constraints allow us to set up two equations to solve for the two un- 
knowns in steady-state oscillation, the bias voltage V B and the oscillation amplitude 

V m . 

Since the values of the passive components in the tank and their resistive loss, 
which (we will show) have a direct impact on oscillation amplitude, typically have 
tolerances of 5-20 %, an analysis with much greater accuracy is unnecessary for prac- 
tical purposes. This provides us with thejustification to describe Mi ’s I-V relationship 
with the simple square law, 

lDs(t) = ^[V gs (t)+V B -V T ] 2 , (7.3) 

where V gs (t) is the AC part of the gate-source voltage and Vj is the threshold voltage, 
below which the transistor cuts off and the current equals zero. The samejustification 
also allows us to assume that because the tank’s quality factor Q is usually high, 
harmonics in Mi ’s drain-source current Ids have been sufficiently filtered by the tank, 
so that V gs (t) is a pure sinewave. We shall experimentally verify the errors caused by 
such simplifying assumptions later. 

If the steady-state bias (V B — Vj) is greater than V m , as depicted in Fig. 7.4a, so 
that Vg S (t) — V m cos(cuo t) never causes the transistor to cut off, then the average current 
through Mi is given by: 

T 
T 

lave = /o = — ( ~[V m coscoot + V B - V T ] 2 dt 
71 J 2 
0 
T 

= [ [coscoot -x?dt = ^(\ + 2x 2 ), 

2n J 4 

o 



(7.4) 
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Figure 7.4 (a) Voltage and current waveforms when M\ is in amplifier mode, (b) 
Voltage of f(x) and its approximation. 



where x is defined as Vj v v& and x < — 1. The fundamental component of the AC 
current at coo, on the other hand, is given by: 

T 

2 T 

/(do = — f [coswof -x] 2 coscootdt = -xpV 2 . (7.5) 

7 r 2 J 

o 

Multiplying I m by the transimpedance between the I s branch and Vgs branch in 
Fig. 7.3b, we obtain the steady-state oscillation amplitude: 

V m = I wo H(cv 0 ) = ^^. (7.6) 

8mc 

Solving Eqs. (7.3) and (7.6) jointly, we obtain: 

V m = ^2ph-gl c mdx = -^-<-\. (7.7) 

Re-organizing (7.7), the same relations can be expressed as: 

P v rn = V2^2fiIo-gic < gmc , or g 2 mc < g 2 0 = 2 pio < ^g 2 c . (7.8) 

From (7.7), we see that if one wanted to keep Mi from cutting off, a combination of 
a small transconductance coefficient and a large bias current would be the best way to 
maintain a sufficiently large amplitude. The maximum V m is reached when the second 
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half of (7.7) is satisfied with equality, in which case V max = g m c/P- Condition (7.8) 
shows that the range for the nominal transconductance g m0 is very small, so that a 
20% increase in g mc can easily kill the oscillation. 

To ensure start-up, as well as adequate amplitude, practical oscillators are designed 
to have a g m0 several times larger than g mc . In this case the conditions in Eqs. (7.7) 
and (7.8) are no longer satisfied, and transistor M) does get cut off during part of the 
oscillation, as depicted by Fig. 7.4b. Eqs. (7.3) and (7.5) must therefore be modified 
to account for the time in which Mi is cut off. This is done by reducing the limit of 
integration to the time M\ is on: 

1 (v T -Vr\ 1 

r = — cos ( — — ) = — cos 1 (x) - 1 < x < 1 . (7.9) 

og) \ V m J wo 

The average current through Mi is now given by: 



2n 



COS * (Jt) 

j [cos(ftioO-*] 2 <if = 

0 



2n 



l+2x 2 



cos '(x) 






(7.10) 



The oscillation amplitude, on the other hand, is given by: 



km — Two H (u>o) — 



1 mfsv, 



gmc 



5 s / 



[cos(cuoO - x] 2 cos(cuo t)dt 






^(2 + x 2 )y/ 1 - x 2 — 3x cos 1 (jc)J . 



3jt R s coqC\ C2 
Re-arranging Eqs. (7.10) and (7.11), we obtain: 

In 4/a \(2 + x 2 )Vl -j: 2 -3xcos _, U)1 

= — /(*)=— - 7 != ; k 

gmc Jgmc I (J -)-2;c 2 )cOS *(jr) — 3xVl — X 2 I 

Io(5 + x) 

3R s ojqCiC2 



2^0 = /|mo\ = d 2 , v ^ 

\gmc ) 



S 2 

omc 



gHx) = 



[(1 + 2* 2 )cos 1 (jc)-3xv / 1 
^ ^(2 + x 2 )Vl — x 2 — 3jccos -1 (*)] 



(7.11) 



(7.12) 

(7.13) 



Even without solving Eq. (7.13) explicitly for x, it can be seen from Eq. (7.12) that 
the oscillation amplitude is only a weak function of x, and thus also of the duty cycle 
of Mi. As x varies within its limits ±1, the amplitude V m varies by no more than 
a third. One can thus say that V m is roughly proportional to the bias current /o, and 
inversely proportional to the critical g m for oscillation, g mc . Fig. 7.5 compares the 
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exact function f{x) and its approximation (5 + x)/3 in Eq. (7.12). The maximum 
difference is less than 2 %, occurring at x = 0, which is a negligible error in terms 
of amplitude prediction. The approximate version of Eq. (7.12) has the advantage of 
much greater simplicity. 




Figure 7.5 Comparison of f(x) with its approximation. 



Although a solution for* in explicit form is more difficult to obtain from Eq. (7.13), 
it is only a function of M\ ’s nominal transconductance g m0 normalized to the g mc of 
the passive tank. Thus * needs to be solved only once numerically versus a useful 
range of normalized g m o and can be looked up once the ratio between g m o and g mc is 
known. Fig. 7.6 shows the calculated x, as well as the corresponding duty cycle a = 
2 t/T —cos, ~ l (x)/n versus {g m o/Smc)- Note that the horizontal axis is on logarithmic 
scale to allow the low g m o/g mc region to be better displayed. It can be seen that both x 
and the duty cycle are relatively steep functions when the normalized g m o of Mi is low. 
For a gmo/gmc ratio of 3, which can easily be expected for practical designs, x = 0.2 
and a = 44%. The improvements of x and a are much slower beyond g m o/g mc > 5, 
resulting only in 0.5 < x < 1 and a < 33% so that setting M\ ’s transconductance 
much higher than the g mc of the tank is inefficient as far as achieving better amplitude 
is concerned. The limit in V m is achieved when gmo/8mc tends to infinity, in which 
case x tends to unity and 




Smc 



It is interesting to note that this limit is independent of the detailed parameters of 
M\ . In fact, the limit is exactly the same if Mi in Fig. 7.3a is replaced by a bipolar 
transistor [26]. It is also worth noting that the duty cycle of Mi is 0% in the limit 
and only 33 % for g m o/g mc = 5. This indicates that the active device, be it MOS 
or BJT, operates deeply in class-C mode in a typical LC oscillator. In other words, 
the active device behaves more like a switch than an amplifier. To demonstrate this, 
we replace the active transistor in the Colpitts oscillator by a switch in series with a 
current source / pu ise, as shown in Fig. 7.7a. The threshold of the switch is set relative 
to the s-node of the circuit. If / pu ise is larger than 7 q, the DC voltage across capacitor 
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Figure 7.6 Parameter x and duty cycle as a function of normalized nominal g m of 
Mi. 



C i will change relative to the threshold of the switch so as to restrict the period the 
switch is turned on. The constraint is that the average current through the switch equals 
the bias current, / pu i se -2z/T — /q. If we increase the available current / pulse further 
and further, the duty cycle of the switch becomes smaller and smaller, as depicted 
by Fig. 7.7b, until the current through the switch becomes a Dirac function of area 
IoT . Thus, irrespective of how the switch is constructed, as long as a large current is 
available during its on-period, the limit of the oscillation amplitude is: 

T 

v m = ( — )^f loTSiOe-j^'dt = — . (7.15) 

\ Smc ) 7 J gmc 

0 




Figure 7.7 Thought experiment: Colpitts oscillator with ideal switch (a). Current vs. 
duty cycle (b). 



The examples ofBJT and MOS implementations show that the details of the switch 
construction determine how fast the limit in Eq. (7.15) is reached. As long as the active 
device, be it BJT, MOS or any other switch-like transistors, has sufficient transconduc- 
tance, the details are unimportant, as the amplitude will be close to the limit set by 
Eq. (7.15). In practical designs, insight into the approximate amplitude dependence 
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on the circuit parameters can be obtained from Eq. (7.15), whereas a more precise 
estimate can be obtained from Eq. (7.12). 

To verify Eq. (7.12), SPICE simulations as well as experiments have been per- 
formed. The simulated amplitude agrees with Eq. (7.12) to within 1 % for Q > 50. 
For a moderate Q of around 10, the simulated amplitude is about 1-2 dB below that 
predicted by Eq. (7.12). To perform controlled experiments, a 78 MHz Colpitts os- 
cillator is constructed as shown in Fig. 7.8, where the values of the main oscillator, 
as well as those of biasing and coupling components, can be found. The choice of 
the oscillation frequency was made to be sufficiently low to allow values of the res- 
onator inductor and capacitors to be much higher than board-level parasitics, and yet 
sufficiently high so that the conclusions from the measurements can be extrapolated to 
high-frequency oscillators in the low gigahertz range. Other reasons against choosing 
a much lower frequency include suitability to high-frequency instruments that prefer 
circuits with low impedance levels in general, and oscillation frequency drift during 
the required measurement time for the same relative accuracy. Avoiding interferences 
caused by strong broadcast stations was also a consideration in the choice of the oscil- 
lator frequency. 




Bci 



Rea 



Figure 7.8 Colpitts oscillator constructed for measuring amplitude and duty cycle, as 
well as the responses to injected sinusoidal current. 



The transistor M i has a transconductance coefficient of 90 pA/V 2 and was pro- 
duced in a 1 pm CMOS process. Before mounting the transistor, the equivalent par- 
allel resistance R p of the tank at resonance has been measured, and the value of the 
parallel resistance, R c \, is selected accordingly to make the resulting R p exactly 500 ft 
(or g mc = 2mS) at resonance. The gate bias voltage is adjusted by means of R a i to 
ensure that the measured DC current through M\ is 0.5 mA during steady-state oscilla- 
tion. The corresponding g m o = 6.7mS. According to Fig. 7.6, x = 0.275 and the duty 
cycle a = 0.41, whereas the oscillation amplitude is 0.44V according to Eq. (7.12). 
Fig. 7.9 shows the measured steady-state source and drain voltages of Mi. The oscil- 
lation amplitude at the source is 450 mV, whereas the duty cycle, measured as the time 
between die two cursor positions of the drain voltage waveform divided by the oscil- 
lation period, is 0.414. Further measurements of a similar oscillator for four different 
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gmc’s versus bias current from 200 p.A to 5 mA [27] show that the amplitude predic- 
tion by Eq. (7.12) is always within 1 dB of measured values. Since normal tolerances 
of both the resonance and the loss of LC tanks, whether made of SMD components or 
integrated on silicon, can easily be 5-20 %, the tolerance in gmc can easily be a cou- 
ple of dBs. The accuracy of Eq. (7.12) is therefore as good as necessary for practical 
design putposes. 



TeK Run: 25 OGS/s ET Sample PrTrig 




Figure 7.9 Measured oscillation waveforms. Channel 1: Mi's drain voltage repre- 
senting its drain current. Channel 3: Mi's source voltage. 



In the above oscillation amplitude derivation, the fact that signals are all periodic 
in the oscillator is exploited. An alternative way to derive the oscillation amplitude, 
which will also prove useful for analyzing small interfering signals at frequencies 
different from the main oscillation, is to represent the Colpitts oscillator in steady 
state by a conceptually equivalent circuit, as shown in Fig. 7.10. To avoid the difficulty 
associated with the piece- wise representation of Mi ’s V-I characteristic, we represent 
the cut-off period of Mi with the switch shown in Fig. 7.10, driven by a gating function 
having the same frequency as the main oscillation, but with an unknown duty cycle 
a = 2z/T = cos“'(x)/rr. The current fed into the passive tank, I s (t), can now be 
represented as: 



Is(t)=^[V m COS(D0t + V B -VT} 2 - 



2z v-^2sin(icuoT) , 

— + 2_ J t* 7 cos(icuot) I . (7.16) 



i=i 



in 



The first term is the usual square-law V-I relationship for MOSFETs, except that we 
assume it is valid even if the sum inside the first brackets (the effective gate overdrive) 
is negative. Since the second term in Eq. (7.16), which represents the gating function, 
becomes zero whenever the effective gate overdrive is negative, Eq. (7.16) is the same 
as the usual piece- wise representation of the MOS transistor’s V-I relationship. This 
is illustrated in Fig. 7.11. 
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Figure 7.10 Equivalent representation of the Colpitts oscillator in Fig. 7.3a. 




Figure 7.11 Voltage and current waveforms showing that the piece-wise nature of 
Mi’s current can be represented by Eq. (7.2). 



With Eq. (7.16), the DC and fundamental frequency components of the feedback 
current can be worked out by pure trigonometrical manipulations. It can be shown that 
the amplitude and duty cycle worked out this way are identical to those obtained by 
the earlier Fourier series expansion method. The advantage ofEq. (7.16), however, is 
that signals that are additive to the main oscillation but are not necessarily periodic in 
the latter’s period can be analyzed, as long as their amplitudes are sufficiently small 
compared to the main oscillation, so that the frequency and duty cycle of the gating 
function remain practically the same. 
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7.4 OSCILLATOR RESPONSE TO AN INTERFERING CURRENT 

Having established the oscillation amplitude as a function of the tank and transistor 
parameters as well as the bias current, and re-emphasized the fact that the active device 
in most high-g LC oscillators operates in class-C mode, we are now better placed to 
tackle the phase noise in LC oscillators. Earlier on we objected to the linear analysis of 
phase noise, which assumes that an interfering signal (sinusoidal current for example) 
will generate a response (voltage) on the tank, and that the transistor acts like a linear 
transconductance equal to the critical g m of the tank so that the equivalent tank loss 
is zero. To demonstrate that this is wrong with a concrete example, we return to the 
Colpitts oscillator and assume that an interfering voltage, V u cos[(wo -f 8co)t + <p u ], is 
present at the gate: 

6 

h (0 = -(E m cos(a)o0+ E u cos[(<uo + Aw)/ + <£„] + V B 

2 t ^2sin(iwor) 

• : -cos(icnor) . 

T f— f in 



-V T ) 2 

(7.17) 



Expanding Eq. (7.16) and collecting current terms only at wo + Aw, we find: 

7j[(ft>o + Aw)/] = | ~~ ^(2 -|- _r 2 )\/ 1 jc 2 3x cos -1 (ic)j + ^(1 -x 2 )i 

8V 

■ — — V u cos[(«o + Aw)/ +<p u ] ■ (7.18) 

Combining (7.18) with (7.11), we obtain: 



/j[(wo-|- Aw)/] 



gmc + 



PVrr, 

3n 



(1-A 2 )5 



V u cos[(wo + Aw)/ + </>„]. 



(7.19) 



Equation (7.19) clearly shows that, except in the very special case of Ixl = 1, the 
equivalent transconductance of M\ to the interfering voltage V u is not exactly the 
critical g m required to cancel the tank loss. In fact, for small values of x, the second 
term in the square brackets in (7.19) is usually much larger than the first term. 

Equation Eq. (7. 16) also shows the more fundamental flaw of the linear analysis. If 
an interfering voltage at wo + Aw indeed appears, the resulting current will not only 
have a component at the same frequency, but also one at its image frequency, wo — Aw, 
due to the gating function. This second component will be fed back to the tank and 
generate a voltage component at the image frequency, so that a double-sideband re- 
sponse will result even if the interference is only single sideband. The voltage at the 
image frequency will in turn be modulated by the gating function, to create its own im- 
age, that is at the original interfering frequency wo + Aw. It is the interaction between 
the component at wo 4- Aw, which we refer to as the response, and the component 
at w - Aw, which we refer to as the image, that holds the key to all the phase noise 
behaviour which we have been trying to understand. 

To analyze this double-sideband response to a single-sideband interference, let us 
inject a small current interference (shown in grey in Fig. 7.10), /„ cos [(wo + &a))t + 
</>„], into the s-node in Fig. 7.10. The main oscillation (carrier) serves as the reference. 
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so its phase is assumed to be zero. Although the phase of the injected current is not 
important, we assume it is </>„ for the sake of generality. The process in which the 
response and image are modulated back and forth is aided by the catalyst of the gating 
function, so that it is reasonable to assume that the steady state is reached within a 
small number of oscillation cycles. The imprecise word small is to be understood as 
much smaller than the number of clock cycles it takes for the response to an impulsive 
interference such as individual bursts caused by white noise to decay significantly 
on a high-Q (low loss) tank, so that steady-state analysis is justified for bursty noise 
sources. At this stage we only know that there will be a voltage response at both 
coo + Aco and coq — Aco, but their amplitude and phase in steady state, V u , <f> u , and 
Vl, 4>l f°r the upper and lower sidebands, respectively, are unknown. 

V gs (t)=Vi(t)~V B 

= V m cos(cuot) + V u cos[(tuo + Aio)t +0 H ] + V L cosl(too - Aco)t + 4>l\ ■ 

( 7 . 20 ) 



Assuming that V u and Vl are much smaller than V m and referring to Fig. 7.10, the 
feedback current l s in steady state is given by 

8 

Is (0 = — { v m cos(tuot)+ V u cos[(ft >0 + Aw)/ -I- <t> u ] 

, r 2: 

+ F/,cos[(&) 0 - Aw)t + <pL] + Vb- VtF- — 

<a = Y< 100%, -► \x\ < 1 j . 

Expanding Eq. (7.21), we find that only the temis at the carrier frequency coo, inter- 
ference frequency coq + Aco and the image frequency &»o — Aco are worth collecting, 
because they will be fed back to the gate of the tank to sustain the oscillation and in- 
terfering voltages. Those terms containing second-order combinations of V u and Vl 
can be neglected because they are much smaller than the rest. The terms around DC 
and higher harmonics of cuocan also be neglected, because they will be heavily attenu- 
ated by the high - Q resonator and will not produce much voltage at the gate. Thus the 
significant part of I s , /' is given by: 

/,(») = ^ [(2 + J/ 2 )\/ 1 - jc 2 - 3a: cos“ 1 (jc) j 

• { V m cosbwoO + V u cos[(<wo + Aco)t +(/> u ] + V L cos[(«o - A co)t + </>/,]} 

+ -r-^(l - x 2 ) I { Vu cos [(&>0 + A co)t +<t>u] + Vl cos[(«o - A co)t + <Pl\ 

3n 

+ Vl cos[(&>o + A «)/ ~4>l]+ V u cos[(cuq - Aco)t -0 U ]) . (7.22) 



+ A2rin(|«or) 

U in J 

( 7 . 21 ) 



Applying Kirchhoff’ s current law to the s-node in Fig. 7.10, we have: 



dV i dVo 

Is(t) + I n (t) + C l-jf~C 2 ^=0 



( 7 . 23 ) 
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It can be shown that for A co <£ coo, 



„ dV i „ dV 2 

Cl ^r _C 2 7 r^ 

2(Ci + C 2 )Aco( V„sin[(<uo + Aco)/ + </>„] - Vl sin[(coo — Aco)/ + </>/.]} 



- R s C x Cvol{V m cos(coo/) + V u cos[(coo + A co)/ + <p u ] + Vz,cos[(coo - Aco)t+<p L ]} . 

(7.24) 



Combining Eqs. (7.11), (7.22) and (7.24) into Eq. (7.23), we obtain: 

8 V m 'j 3 

— — (1 - x 2 ) 5 { V], cos[(coo + A co)t+<J> u ] + VtcosKcao - Aco)/ + 4>l] 

+Vi cos [(coo + Aco)/ -<Pl\ + Vu cos[(coo - A co)t -<p u ]} 

+ 2(Ci + C 2 )Aco{V u sin[(a>o + Aco)/ +<f> u ] - Vi,sin[(coo - Aco)/ + </>/,]) 

+ 1„ cos[(coo + A u>)t + 4>n] = A ■ cos[(coo + Aco)/] + B ■ sin[(coo + Aco)/] 

+ C • cos[(cwo — Aco)/] + D ■ sin[(a>o - Aco)/] = 0 . (7.25) 



For Eq. (7.25) to be valid at all times, the coefficients for the sine and cosine terms at 
both frequencies must be zero. This provides us with four simultaneous equations to 
solve for the four unknowns in steady state, V u ,<p u , Vl, and 4>L- 

A =^(1 -x 2 )kv L cos(<l> L )+ E„cos(^)] + 2(C 1 + C 2 )AcoE H sin(^ M ) 

3/r 

4~ In cos <p n = 0 , (7.26) 

B=^( l-x 2 )kv L sin (4> l ) ~ Vu sin (<M1 + 2 (C, + C 2 ) Aco V u cos (<p u ) 

3 n 

— I n &m<t> n = 0 , (7.27) 

C =~{\ - x 2 )* [Vl cos (<p L ) + V u cos (</>„)] - 2(Ci + C 2 )AwV L sin (</> L ) = 0 , 

37T 

(7.28) 

D =~(\ -x 2 )3 [V u sin(<j> u ) - V L sin(0£>] - 2(Ci + C 2 )AcoV L cos(^t) = 0 . 

3 n 

(7.29) 



Solving Eqs. (7.26H7.29) jointly, we find: 



4>l 

4>u 

V L 



tan 1 



pV m (l-x 2 )l 
3n{C\ + C 2 )Aco 



-<t>„+kn 



k = 0,1,2... 



(7.30) 



n —4 >l~ 4 >c where tj) c = tan 1 
V u cos <j> c 



6;r(Ci + C 2 )Aco 
pV m (l-x 2 )i . 



[2(Ci +C 2 )A<o] 2 [1 + 3cos 2 (<Pc)] 

I_l [P Vm] 2 d ~ x 2 ) 3 + [6tt(Ci + C 2 ) Aco] 2 

[2(Ci + C 2 ) Aco] 2 (4 • [p V m ] 2 (l - x 2 ) 2 + [6jt(Ci + C 2 ) Aco] 2 } 



(7.31) 

(7.32) 



.(7.33) 
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Usually the offset frequency is relatively small, 6n(Cl + C2)Aa> $V m (l — jc 2 )5 , 
so that the upper- and lower-sideband responses are equal in amplitude and 180° out 
of phase, as shown by Eqs. (7.31) and (7.33). This is just the same as what would be 
created by a modulation in phase, so that additive interference does create modulation 
in phase, due to the nonlinear and switching behaviour of M j . For the same low offset 
frequencies, 



’n 



4[2(C, + C 2 )Aw] 2 



or 



V„ 



In 

4 (Ci + C 2 )Aw 



(7.34) 



Note that V u in Eq. (7.34) is half of what would be predicted by a linear analysis. 
The upper-sideband response is indeed inversely proportional to the offset frequency, 
as has been well known experimentally. This inverse dependency does, however, not 
continue uninterruptedly, according to Eq. (7.33). At sufficiently high offset frequen- 
cies outside the passive tank’s passband, 67r(C 1 + C2)Aa> )£> 20 V m (\ — x 2 )? , and the 
transistor no longer has sufficient g m to provide any significant feedback to counter 
the interfering current, so that the relationship between the latter and the voltage re- 
sponse is purely that of a passive tank, and the image sideband disappears, according 
to Eqs. (7.32) and (7.33). The result is now identical to that predicted by a linear 
analysis: 



F u 



2 • (C) + C 2 )A&> 



(7.35) 



The transition point where the image sideband response Vi becomes 3 dB smaller than 
the upper sideband response V u can also be derived from (7.32): 



Aw - 3dB = /^(l-* 2 )* 

6^(Ci+C 2 ) 



(7.36) 



For oscillators with 100% current duty cycle, x < — 1, the comer frequency is zero 
and the noise remains additive, because there is no more gating function and the squar- 
ing nonlinearity of the MOSFET only translates the upper-sideband interfering signal 
to frequencies around DC and the second harmonic of the carrier, which will be sup- 
pressed by the carrier. When x tends to 1, as in the case of a lossless resonator, the 
comer frequency also tends to zero and the output noise remains additive. For most 
practical oscillators |jc| < 1 and noise becomes multiplicative, affecting the phase of 
oscillation. The -3 dB offset frequency given in Eq.(7.36) is usually too high to be 
observed in normal phase noise measurements, because the sideband spectral density 
at such a frequency is already well below the noise floor of the measuring instruments. 

It is important to note from Eqs. (7.34)— (7.36) that the detailed transistor parame- 
ters are only important in determining the boundary between the interferences that will 
merely be super-imposed on the main oscillation and those that will cause the phase 
of the oscillation to jitter. In the former case, the 1 /Aw characteristic is merely due to 
the impedance of the passive tank, decreasing outside its passband. In the latter case, 
it is the creation of the image response and the effective negative feedback loop be- 
tween the signals at the two frequencies that are responsible for canceling any excess 
g m of Mi that is not needed for compensating the loss of the passive tank. Although in 
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both cases the response is inversely proportional to the offset frequency, the two cases 
differ by 6 dB and cause very different disturbances to the oscillator. In the phase jitter 
case, both the image response and its interaction with the response at the frequency 
of interference are due to the active device’s nonlinearity, especially the switching 
action in most oscillators. Analyses attempting to bypass the explicit description of 
the transistor’s nonlinearity (a slowly varying AGC is a linear device in this context) 
therefore bypass the most fundamental mechanism of amplitude-to-phase-noise con- 
version. This tends to result in shaky foundations even though the resulting models 
of such analyses are good guesses for what a real oscillator’s noise spectrum would 
look like. Our analysis shows that the modulation process is essential, yet the exact 
amount of M\ ’s g m , and indeed the detailed shape of its large-signal V-I characteris- 
tic, are unimportant. Equations similar to (7.26)— (7.29) can be set up for transistors 
with quite different characteristics. While the comer frequency will now be different 
from that given by Eq. (7.36), Eq. (7.34) will still result at low offset frequencies. In 
this sense our analysis is general and is applicable to all high-0 LC oscillators. 

To verify the results in this section, the oscillator in Fig. 7.8 is used to test its voltage 
response to an injected deterministic current. Since the equivalent input impedance at 
the s-node can be very high close to the resonance frequency of the tank, injecting a 
well-known current is difficult. Instead of applying a current directly, we apply a sinu- 
soidal voltage as shown, and measure the voltages V s and V, . The Norton equivalent of 
the circuit within the dashed box in Fig. 7.8 is shown in Fig. 7.12. Since the effective 
injected current is given by V/ //?*i , the impedance Z looking into the s-node of the 
oscillator from the equivalent current source is the measured node voltage Vi divided 
by Vi/Rb\,Z = (V s /Vj)R b \. Since for low-loss tanks the current flowing through C\ 
and C 2 close to resonance is much higher than other current components flowing into 
the s-node, the latter can be neglected and V gs (t) = (Ci/C2)V f (0- The transimped- 
ance Z f = Vg S /ln can therefore be measured reliably, at least in the close vicinity of 
the carrier, via the voltage ratio (V s / V, ). Close to the carrier, the applied signal lev- 
els are kept very small to avoid pulling the main oscillation. As the offset frequency 
is gradually increased, the voltage response gradually becomes so small that it sinks 
under the noise floor of the spectrum analyzer. The interfering signal level is then 
increased whenever necessary just to keep the spectral peaks representing the voltage 
response above the noise floor by a few dBs. Also measured is the ratio between the 
voltage response at the image frequency and the injected current, which we refer to as 
the image transimpedance. 




Figure 7.12 The Norton equivalent representation of the subcircuit devised to inject 
controlled current into the oscillator. 
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The measured transimpedances are shown in Fig. 7.13, against those calculated us- 
ing Eqs. (7.32) and (7.33). Also shown is the straight line to which the upper-sideband 
transimpedance is asymptotic at higher offset frequencies, which would result from a 
linear analysis. The measurements and calculations agree to within 1 dB at low offset 
frequencies and 2 dB at high offset frequencies, which is remarkable over nearly 6 
decades of offset frequency and 7 decades of impedance levels. Even more remark- 
able is the accurate prediction of the cross-over point at 1.3 MHz. This shows that as 
long as the parasitics are small and the oscillator can be truly represented by Fig. 7.3a, 
— the formulas derived above are very accurate. 




Figure 7.13 Calculated and measured voltage responses at both the upper sideband 
and lower sideband, to injected current at the upper sideband (the circuit parameters 
are slightly different from those in Fig. 7.9: V m =0.37V and *=0.2.). 



The significance of the theoretical prediction of the cross-over point and its accurate 
experimental verification lies not so much in the 6dB difference in noise prediction 
at higher frequencies as in being a litmus test of the rigorousness of the formulation 
of our noise analysis. The cross-over point is usually so far away in relative offset 
frequency from the carrier that the noise there and beyond is so low that it does not 
affect any practical design. The fact that nobody ever noticed its existence before is 
ample proof of this. However, the two observations embodied in the conclusions of our 
analysis and experiments are so fundamental and common sense that analyses leading 
to opposite conclusions to any of the two must be based on false assumptions. The two 
observations are that interfering noise close to the carrier causes phase modulation 
that results in a double-sideband response, and that interference far away from the 
oscillation frequency is no more than an additive noise. In the latter case, the response 
must be very similar to that of a passive tank alone. 

A further point to note from Fig. 7.13 is that the l/Acu characteristic continues all 
the way down to 100 Hz offset, where further measurement was prevented by the drift 
of the free -running oscillator. An early measurement of a Clapp oscillator at a similar 
carrier frequency, locked to a crystal oscillator by a very-narrow-band PLL to reduce 
drift, showed that such a 1/Atu characteristic continues nearly to 10 Hz. The experi- 
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ments therefore demonstrate that analyses predicting a much higher comer frequency 
are wrong. Below such offsets, pulling must be dominant so that the exact shape of 
the phase noise should be immaterial. At the high offset end, our measurements show 
that the 1/Aw characteristic continues until 64 MHz, further than 2/3 of the carrier 
frequency itself! No flattening is observed, so that part ofLeeson’s description of the 
phase noise spectrum is incorrect. Our experimental results are to be expected, though, 
for it would be very hard to imagine what noise mechanism would create a response of 
constant amplitude on a passive tank, while the latter’s impedance gradually declines 
to zero! The flat noise spectrum we sometimes do see in practical oscillators must 
have come from buffers rather than the LC oscillator proper. 



7.5 NOISE-TO-CARRIER RATIO IN A CMOS COLPITTS OSCILLATOR 



The analysis of the oscillator response to a single sinusoidal current can easily be 
extended to multiple signals at different frequencies. If the sum of all interfering 
signals is small compared to the carrier, then one can assume that the interfering cur- 
rent at each frequency will generate a pair of response voltages at the gate, and the 
expression of M \ ’s feedback current is similar to Eq. (7.21), except now there are 
many pairs of response voltages at the gate that are unknown to us. As long as we 
can neglect the second-order combinations of interfering voltage components after ex- 
panding the square-law function, the subsequent mathematical manipulations can be 
considered linear, because the multiplication by the gating function only results in 
frequency translations of each component. Thus each pair of voltage responses can 
be solved individually, and will be related to the individual current component in the 
same way as described by Eqs. (7.32) and (7.33). In this sense, the oscillator response 
to composite current of different frequencies is the superposition of the responses to 
current components at each frequency. This enables our noise analysis to follow all 
the reasoning of noise analyses of common linear networks. 

If the offset frequency is much higher than Aw _3dB (Eq. (7.36)), the noise spectral 
density of the oscillator’s gate-source voltage at each frequency is simply given by 
Eq. (7.35) times the spectral density of equivalent noise current at that frequency. In 
the much more important case that the offset frequency is below Aw _3dB , the response 
is given by Eq. (7.34) times the spectral density of equivalent noise current at the same 
frequency plus the contribution due to the image of noise current component at the 
other side of the carrier. 



V^(m + Aw) 



I%(coq + Aw) rf(wo - Aw) 

16 • (Ci + C 2 ) 2 Aw 2 + 16 • (Ci + C 2 ) 2 Aw 2 
, / n (wo + A w)/*(w - Aw) 
8 (Ci+C 2 ) 2 Ao> 2 



In (7.36), C' represents the cross correlation factor between the two noise components. 
For white noise it is reasonable to assume that noise components at different frequen- 
cies are uncorrelated to each other, i.e. C' = 0, so that the noise voltage spectral density 
is given by: 



V 2 (<wo + Aw) = 



8-(Ci + C 2 ) 2 Ao> 2 ’ 



(7.38) 
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Experiments will be compared to Eq. (7.38) later to verify the assumption C' = 0. 
Now that we have established the relationship between the phase noise spectrum and 
the input white noise current, what remains to be done is to identify all major noise 
sources in the Colpitts oscillator. In the order of increasing importance, they are the 
noise current associated with the bias transistor M b , the noise current due to the loss 
resistance R s of the tank, and the noise current due to the main switching transistor 
Mi • If we can convert all three sources into equivalent current sources as shown in 
Fig. 7.14, then the total output noise can be easily obtained by applying Eq. (7.38). 




2 



n 



InB 



Figure 7.14 Colpitts oscillator with all noise sources represented as current into the 
s-node. 



Since the drain of Mb is directly connected to the s-node in Fig. 7.3a and its current 
is constant, the noise current due to Mb is the most straightforward, l„ B = ^~8M ti - 
The noise contribution of R s can be worked out via the Norton equivalent current at 
the s-node. Representing the noise of R s as a voltage source and short-circuiting the 
s-node, we can obtain the mean-squared current noise power spectral density (PSD) 
of the Norton equivalent circuit near the resonant frequency, % (c oCz ) 2 -4 kTR s = 

0 C 2 /Ci)-4kT-g mc . 

The time-varying nature of its drain-source current makes M j a more complex 
noise source. Referring to Fig. 7.7b, the average drain-source current through Mi 
during the time it is on is given by Eq. (7.39). 



/' 

'ml 



ll 



E =;o (n = , ” 

a \2r ) cos U*) 

8 kT , SkT I n 8 kT I n 

3 3 y cos 1 00 3 V cos *(*) 



(7.39) 

(7.40) 



The average white noise power spectral density during Mi ’s on-time is therefore given 
by Eq. (7.40). If a time-domain noise current « '(f) has the same power spectral density 
as that given by Eq. (7.40), then the time-domain noise current for Mi is equivalent to 
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n'(t ) gated by the switching function s(t) with duty cycle a = cos l (x)/n. 



n{t) = n'(t) 



It 2sin(iwor) 

■^r + 2^ — (coskuqO 



i=l 



in 



(7.41) 



Since n\t ) itself has a bandwidth much wider than the frequency of the gating func- 
tion, the power spectral density of n(t ) around the oscillation frequency is approxi- 
mately that of n'(t), given by Eq. (7.40), scaled by the duty cycle of the gating func- 
tion [29]. 



2 cos ’(jc) 2 %kT I cos - 'Or) 
K = = —gmOyj — - - 



(7.42) 



A more accurate estimate of / 2 is possible by splitting M\ ’s on-time into infinitely 
small intervals, each dr wide, to allow the exact current to be used for each interval 
instead of the average in Eq. (7.39). Since the noise contribution of each dr interval 
can be calculated with a gating duty cycle of dr /T, the overall noise PSD is the sum 
of the individual contributions, which amounts to an integration that leads to 



,2 V 1 - x 2 - x cos ^.r) 

h = -nr-P V m 

J 7T 



(7.43) 



Since most practical oscillators already have fairly low duty cycles, the difference be- 
tween Eqs. (7.42) and (7.43) is fairly small. For the circuit in Fig. 7.8, the overall noise 
estimate using Eq. (7.43) is about 0.5 dB below that using Eq. (7.42). Equation (7.42) 
is therefore more preferable because it gives more insight into the relationship between 
noise and the nominal transconductance, as well as the current duty cycle. Both are 
likely to be design parameters. 

Replacing the noise current term in Eq. (7.38) with the sum of contributions from 
the bias transistor, the resistive loss of the tank, and that of the switching transistor 
Ml, we obtain: 



V„(a)Q + Aw) = V n 2 (w - Aw) 
kT 

~ 2-(Ci+C 2 ) 2 Aw 2 



2 

jgmB + 



(C 2 \ , 2 jcos~ l (x) 

\C~J 8mc+ 3 8m0 y n 



(7.44) 



The ratio between the noise spectral density and the signal power can now be derived 
by combining Eqs. (7.12) and (7.44): 




(7.45) 
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For small duty cycles, x is close to unity, and Eq. (7.45) is then approximately given 

by: 



N 

c (Acu) = 



kT / mq d 
/2g 2 \2A co) 



2 

^ SmB + 




2 

8mc 4 " “ -8m0 



cos _ 1 (jc) 

n 



(7.46) 



To verify Eq. (7.45), the parameters in Fig. 7.8 were used to calculate the oscillator 
output spectral density due to thermal noise sources. The noise contributions in the 
square brackets in Eq. (7.45) are 0. 17 mS for the 6 k£2 bias resistor, 2 mS for the rest of 
the resistive loss of the passive tank that had been measured before M\ was mounted, 
and 2.86 mS due to M\. The quality factor Q of the tank is slightly under 25. This 
gives a calculated phase noise of -1 15 dBc/Hz at 10 kHz offset. The calculated phase 
noise is compared with measured results in Fig. 7.15, covering 4 octaves of offset 
frequency range. 
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Figure 7.15 Calculated and measured phase noise for the oscillator in Fig. 7.8. 



Because the oscillator was designed to be relatively high-Q (to be close to the prac- 
tical situation that has been assumed in the derivations of this paper), the overall phase 
noise is very low. The noise floor of the instrument and the onset of up-converted 
flicker noise therefore limit the measurable thermal noise region to the last two oc- 
taves. This gives us three measurement points that are clearly on the 6 dB/oct line. To 
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Figure 7.16 Measured output power spectrum for the oscillator in Fig. 7.8. 



remove the influence of the noise from the instrument (HP 8563 E Spectrum Analyzer), 
the latter is calibrated by measuring an 80 MHz crystal oscillator at the corresponding 
offset frequencies. Since the phase noise of an 80 MHz crystal oscillator is better than 
-1 10 dB at 300 Hz offset [30], the measured noise at offsets above 1 kHz is effectively 
that of the instrument. This uncorrelated noise contribution of the instrument is then 
subtracted from the measured noise power at the 10 kHz and 20 kHz offset points. As 
is typical of most noise measurements, the repeatability of the measured noise power 
spectral density (with averaging) is about within 0.5-1 dB. The measurement accu- 
racy of the points shown in Fig. 7.15 can therefore be considered to be 1 dB. This is 
the same accuracy with which the calculated thermal noise line in Fig. 7.15 predicts 
the measurement. The measurement at 1.25 kHz is 16 dB higher than that at 5 kHz, 
showing this region to be strongly influenced by flicker noise. From the slope of the 
measured noise PSD we can infer that the cross-over point between thermal noise 
and flicker noise is around 1.5 kHz. Adding the inferred flicker noise to the calcu- 
lated white noise contribution, we have a calculated phase noise that also agrees with 
measurement within 1 dB for the first two octaves of measured offset frequency. This 
can be seen as conclusive evidence that Eq. (7.45) accurately predicts the sideband 
noise-to-carrier ratio, not least because there are no parameters used in the calculation 
of Eq. (7.45) that have not been independently verified by measurement to the desired 
accuracy first. Since the relationship between an injected current and measured volt- 
age response has also been accurately verified in the previous section, the significance 
of the measurement in Fig. 7.15 lies in verifying Eqs. (7.36) and (7.38), as well as the 
power spectral densities of all the white noise sources in the oscillator, especially that 
of Equation (7.38) is 3 dB below what would be predicted by a linear analysis 
similar to Eq. 17 in [9]. Fig. 7.16 shows a measured spectrum of the oscillator. 
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7.6 EXACT DESIGN OF RF OSCILLATORS 

In addition to having been derived on a rigorous basis, Eq. (7.45) is completely based 
on circuit parameters known to the designers. This makes the discussion of the exact 
design of RF oscillators most straightforward. Before this discussion, however, it is 
interesting to compare Eq. (7.45) or Eq. (7.46) to the well-known Leeson’s formula, 
which designers have been using for the last 30 years, to examine the dependence on 
the resonator’s quality factor Q in particular. In Leeson’s original paper, the signal 
level P s at the input of the active device is not very clearly defined, so that it can 
either be interpreted as the square of the oscillation voltage, or the power dissipated 
by the resonator. In the former case we can see that for a constant oscillation voltage, 
the noise-to-carrier ratio will only scale with the noise power given in Eq. (7.44), 
which clearly does not scale with Q 2 . In the latter case, P s = (C l / C2)g mc (V m ) 2 /2. 
Equation (7.46) can thus be rewritten as 




2-L„ 2 (ruo + Aut) /CA 
(Cx/C 2 )gmcV 2 \C 2 ) 8mC 

2kT / ojq \ 2 / Ci \ 2 g m B ( Cl \ 2 2 SmO jcos^'(x) 
(Q)2p s \2Aw) \C 2 ) 3gmc \Ci) 3 g mc y 7i 



2 kT F / coo \ 2 
(Q) 2 P S V2W ’ 



( 7 . 47 ) 



which apparently is Leeson’s formula with an inverse dependence on Q 2 \ Closer ex- 
amination of Eq. (7.47), however, shows that both the signal power Ps and the noise 
figure F so defined, far from being independent parameters, are usually also strong 
functions of Q. No conclusions could therefore really be drawn about the Q depen- 
dence of the oscillator noise performance on the basis of Leeson’s formulation. 

In practical situations, one does sometimes find that doubling the Q of the tank 
improves the N/C by exactly 6 dB, while the transistor and its bias current are un- 
changed. This can arise when the change of Q is purely due to improvement in the 
loss resistance R s , and the middle term in the square brackets of Eq. (7.46) is much 
smaller than the other two terms. This may have been the case especially for designers 
using discrete MOSFETs or BJT transistors, where the transconductance for a given 
current is not within the designer’s control. For an integrated circuit designer this need 
not be so, because the size of the transistor should be scaled down with g mc to main- 
tain just the necessary g m o/gmc range of 3-5 as explained earlier. In this case, the 
noise factor F in Leeson’s formulation is unchanged, while the signal power P s has 
doubled. The improvement of N/C will be 9 dB each time Q is doubled by halving 

R s 

On the other hand, N/C can be significantly changed by different combinations of 
tank parameters with the same quality factor Q. Assuming, as a typical example, that 
the loss resistance R s of the inductor dominates the loss of the tank and scales linearly 
with the inductance L s . Doubling the inductance and halving the capacitances then 
leaves both the Q of the tank and its resonance frequency unchanged. The combined 
change in resistance and capacitance, however, causes g mc in Eq. (7.46) to be halved, 
as well as the required g m o, so that 3 dB can be gained either in achievable N/C or in 
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the required current consumption without improving the quality of the inductor! In- 
deed, we note that most recently published oscillators with integrated inductors have 
very small capacitors, which gave them respectable phase noise performance at a rea- 
sonable current consumption even though the integrated inductors continue to have a 
poor quality factor between 3 and 6 at 1 GHz. One’s ability to improve phase noise by 
reducing capacitance, however, is limited by the parasitic capacitors associated with 
the circuit, which have higher losses and which tend to increase the required tuning 
range for the desired frequency. A higher tuning range in capacitance may also result 
in higher variability of oscillation amplitude, which in turn affects N/C. 

An additional advantage of the phase noise model described by Eq. (7.45) is that 
the role of current consumption (as opposed to power consumption Pqc, or the signal 
power P s ) is explicitly stated. Indeed, there is an almost one-to-one relationship be- 
tween N/C and the bias current Iq. This point is important in RF oscillator designs for 
mobile communications, where power consumption is extremely important. 

For a local oscillator (FO) used in radio frequency circuits, especially those used 
in cellular telephones, the three most important parameters are oscillation amplitude, 
phase-noise-to-carrier ratio and power consumption. For a given voltage supply, the 
latter reduces to current consumption. A well-defined FO amplitude is required by the 
mixers, which tend to exhibit a higher noise figure and less conversion gain both when 
the FO amplitude is too low and too high. Fimiting amplifiers (buffers) are frequently 
used to remove variations of the FO amplitude due to the imprecise nature of the 
tank loss as well as the tuning of capacitors, but a well-designed worst-case amplitude 
range still results in the best power consumption. The phase noise of the oscillator 
relative to the carrier amplitude is responsible for reciprocal mixing of out-of-band, 
strong interfering signals to the same IF frequency as the desired RF signal. It also 
reduces the signal-to-noise ratio in the desired radio channel. Power consumption, on 
the other hand, directly affects the lifetime of the battery. The latter is usually the most 
expensive item in a cellular telephone [31,32]. Bearing the above performance para- 
meters and Eqs. (7.12), (7.45) and Fig. 7.6 in mind, we can formulate the following 
systematic design procedure for RF oscillators: 

1. Determine the lowest capacitance that can be used based on the knowledge of cir- 
cuit parasitics, variability of the capacitors themselves, required RF tuning range 
(for channel selection, for example) and the available variable capacitor tuning 
range. (Note that this may be substantially larger for a production circuit than 
that for a laboratory experiment.) 

2. Select or implement (in the case of fully integrated circuits) the inductor in such 
a way that ensures the best worst-case Q for the resonator. The value of the in- 
ductance is determined by capacitances chosen in the previous step and the desired 
nominal frequency. 

3. Set the value of the bias current to achieve the desired amplitude according to 
Eq. (7.12) and using the worst-case g mc . (If the difference between best- and worst- 
case amplitudes exceeds what is acceptable by the mixer, then either a limiting 
amplifier must be used, or step 1 must be repeated to reduce the difference.) 

4. Use Eq. (7.12) and the worst-case g mc to calculate the ratio N/C at critical offset 
frequencies, assuming that g m o will be 3-5 times g mc . If it exceeds the prescribed 
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limit by k dB, and a limiting amplifier is used to limit the LO amplitude, then bias 
current should be increased by 0.5k dB, to reduce N/C by k dB. 

5. If no limiting amplifier is used, the bias current should then be increased by k dB in 
step 4, which reduces N/C by 2k dB. The design procedure must then be repeated 
from step 1 to ensure that the best-case amplitude is not too high. This is done by 
increasing the capacitances by 0.5k dB. so that gmc is increased by k dB to bring 
amplitude down to the desired level. This increases the N/C by another k dB so 
that the acceptable limit is just met. 

6. Size Mi according to the bias current, so that its nominal transconductance g m o is 
3-5 times g mc - 

7. The transconductance g m g of the bias transistor Ms should be made as small as 
possible to reduce noise. Since the bias current is fixed, this means reducing its 
W/L ratio. The latter is limited by the available voltage headroom for a given 
supply to keep the transistor in the saturation region of operation. 

With the accurate formulas derived in this paper, the Colpitts oscillator design be- 
comes a routine for the designer as of step 3 above. The designer can then concentrate 
on the less well-defined steps 1 and 2, estimate of parasitics and variability as well as 
implementation of better inductors. 

7.7 CONCLUSIONS 

Despite numerous journal and conference papers published in the past 60 years, our 
understanding of why the phase -noise-to-carrier ratio is inversely proportional to the 
offset frequency from the carrier has been limited. This paper seeks to argue why the 
reasoning behind most of the phase-noise models has been based on shaky founda- 
tions. As a result, the models’ predictions of noise are wrong in one portion of the 
spectrum or the other for LC oscillators. 

In this contribution, the oscillator is described in the time domain without any a pri- 
ori linearity assumption. Noise sources are introduced where they are first added to 
the signal, to avoid assuming a conclusion (such as noise in phase) before the start 
of any analysis. The only assumption based more on intuition than rigour was the 
assumption that for bursty type of sinusoidal interferences, modulation by switching 
helps the responses reach steady state in much less than the natural decay time of 
the passive tank. Experiments seem to validate this assumption. All basic equations 
set up in this paper are based on the Kirchhoff laws. The approximations that have 
been made include assuming high-Q resonators that will attenuate harmonics of the 
oscillation voltage on the resonator to negligible levels, a piece-wise square-law V-I 
characteristic for the MOS transistor, and that the interfering signals are sufficiently 
small compared to the main oscillation such that the duty cycle in which the transistor 
turns on remains virtually unchanged. 

The active device in a typical LC oscillator operates heavily in class-C mode, turn- 
ing on for only a short fraction of each oscillation period to inject a current pulse into 
the LC tank to maintain the steady state. In the limit, the oscillation amplitude of any 
branch voltage within the tank is two times the average current through the branch of 
the switching active device divided by the transconductance between the two branches 
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at resonance. This limit is independent of the exact V-I characteristic of the active de- 
vice. Practical high- Q LC oscillators, such as those implemented with BJT or CMOS 
transistors, have oscillation amplitudes very close to this limit. 

The highly nonlinear V-I characteristic of the active device that causes the latter 
to be switched by the main oscillation modulates any interfering signal to generate a 
response not only at the frequency of the interfere^ but also at its mirror frequency 
with respect to the carrier. This image response is in turn modulated by the oscillation 
through the transistor, which creates its own image at the frequency of the original 
interfering signal. This forms an additional negative feedback loop that cancels any 
excess positive feedback of the transistor. The response at the interfering frequencies 
close to the carrier thus becomes inversely proportional to its frequency offset from the 
latter, irrespective of the exact V-I characteristic and the duty cycle of the switching 
device, nor the passband width of the passive tank. Such a response and its image are 
practically equal in amplitude and opposite in polarity. Their combined effect is there- 
fore equivalent to a modulation of the carrier in phase. At higher offset frequencies, 
the tank impedance becomes very small so that very little voltage is produced at the 
transistor input. The transistor’s feedback current, which contains the image response 
due to modulation, is insignificant compared to the interfering current. The voltage 
response to the interference is therefore now virtually the same as that of a passive 
tank, which still decreases with the inverse of the offset frequency, with negligible 
image component. The effect of noise — far away from the carrier is therefore merely 
additive. The cross-over point between dominance of phase noise and dominance of 
additive noise is a function of the transistor and tank parameters. 

Because of the existence of image response, phase noise due to white noise is 3 dB 
higher than that due to noise components at frequencies above (or below) the carrier 
alone. The switching nature of the transistor also makes the noise generated by the 
latter depend on the duty cycle, and hence on the parameters of the rest of the oscillator. 
All noise sources, including that of the switching transistor, have been calculated in 
this chapter. The resulting expression for the phase-noise -to-carrier ratio comprises 
therefore only original circuit parameters instead of intermediate variables that require 
determination by simulation or experiments. 

Controlled experiments have been carefully constructed to verify the theory de- 
rived in the paper. The predicted oscillation amplitude is consistently within 1 dB of 
measurements. The predicted duty cycle also matches measured values by the same 
accuracy. The response of the oscillator to an injected interfering current as well as the 
image response agree with what the theory predicts within the accuracy with which 
parameters of the passive tank can be measured. In the 80 MHz range, this accuracy 
is better than 1 dB. The cross-over frequency between phase and additive noise, and 
indeed its very existence, have been confirmed by experiments, in practical designs 
Measurements of the phase-noise-to-carrier ratio also show that the agreement with 
the formula derived in this paper is within 1 dB, the repeatability of any phase noise 
measurement and the accuracy within which circuit parameters have been indepen- 
dently measured. The phase noise model in this paper can therefore be used with 
confidence in practical designs. 

Although the specific Colpitts oscillator has been used in deriving the formulas in 
this paper, the basic reasoning behind the setting up of basic equations is not specific 
to Colpitts oscillators. Although for other LC oscillators, the determination of the 
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amplitude and duty cycle as well as the critical g m will be different, the basic ideas 
about why the oscillator’s response to interference is phase noise for low and additive 
noise for high offset frequencies and the role of the image in the inverse dependence 
of phase noise on offset frequency will remain the same. In this sense, the analysis in 
this paper is general for all LC oscillators. 
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Abstract: This chapter describes the design of a CMOS transceiver targeting dual- 
band applications in the 900 MHz and 1.8 GHz bands. The receiver is based on a mod- 
ified Weaver architecture, and the transmitter incorporates new methods of quadrature 
up-conversion. The receiver and the transmitter have been fabricated in a 0.6pm CMOS 
technology. 

8.1 INTRODUCTION 

The emergence of various wireless standards within the US and around the world 
has created a demand for RF transceivers that can operate in more than one mode. 
The standards deployed in the 900 MHz band are the TDMA standard IS-54 and the 
CDMA standard IS-95 (US) and the GSM standard (Europe). Additionally, with the 
availability of the 1.8 GHz band for cellular networks, new standards such as PCS 1900 
and DCS 1800 have been introduced. Furthermore, the Global Positioning System 
(GPS) (1.5 GHz) and the European cord-less standard DECT (1.9 GHz) provide attrac- 
tive services that can be incorporated in mobile telephones. In addition to providing 
“roaming” capability, the use of two or more standards that operate in different bands 
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boosts the capacity of the network, thus lowering the probability of unsuccessful calls 
during peak traffic hours. Thus, multi-standard RF transceivers are predicted to play a 
critical role in wireless communications in the 900-2000 MHz range. 

This paper describes circuit and architecture design techniques for dual-standard 
RF transceivers with emphasis on CMOS implementations. Particular attention is paid 
to issues such as cost, level of integration, form factor, power dissipation, and time to 
market. These techniques are presented in the context of a dual-band system specifi- 
cally designed to operate with GSM and DCS 1800 standards. Both the receiver and 
the transmitter have been fabricated in a 0.6pjnCMOS technology. The experimental 
results obtained for each circuit are described in [1-3]. 

8.2 RECEIVER DESIGN CONSIDERATIONS 

Accommodating two or more standards in one receiver generally requires substantial 
added complexity in both the RF section and the baseband section, leading to a high 
cost. Thus, the system must be designed so as to share as much hardware as pos- 
sible. For this reason, both the architecture design and the frequency planning of a 
multi-standard transceiver demand careful studies and numerous iterations. 

The GSM and DCS 1800 standards incorporate the same modulation format, chan- 
nel spacing, and antenna duplexing. Summarized in Table 8.1 are the characteristics 
of each standard, indicating that a dual-band transceiver can exploit the properties 
common to both so as to reduce the off-chip hardware. 



Table 8.1 System characteristics of GSM and DCS1 800. 





GSM 


DCS 1800 


Modulation 


Gaussian Minimum Shift Keying 


Multiple Access 


Time-Division Multiple Access 


Duplexing 


Frequency-Division Duplexing 


Receive Band 


935-960 MHz 


1805- 1880 MHz 


Transmit Band 


890-9 15 MHz 


1710-1785 MHz 


Channel Spacing 


200 kHz 


200 kHz 


Number of Channels 


124 


350 



In order to minimize the number of oscillators and synthesizers, the receivers and 
transmitters in a dual-band system must be designed concurrently, with the frequency 
planning carried out such that the receive and transmit paths are driven by the same 
synthesizers. Although GSM and DCS 1800 use frequency-division duplexing (FDD) 
at the front end, their actual operation is somewhat similar to time-division duplex- 
ing (TDD) because their receive and transmit time slots are set off by 1.73 ms (three 
time slots) [4]. Thus the receiver and the transmitter can time-share the frequency 
synthesizers. 
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Figure 8.1 Simple dual-standard receiver. 




Figure 8.2 Dual-band receiver. 



Fig. 8.1 illustrates an example of a dual-standard receiver with two different input 
frequency bands. After down-conversion to a single intermediate frequency (IF), the 
two signals are added and digitized. The remaining IF and baseband processing is 
subsequently performed in the digital domain. Note that the dynamic range and noise 
of the A/D converter are critical issues in such an environment. 

The receiver of Fig. 8.1 requires many off-chip filters. Alternatively, the receiver 
can be configured as shown in Fig. 8.2 [1], where the 900 MHz and 1.8 GHz bands 
are processed by two paths containing a low-noise amplifier (LNA) and a quadrature 
down-conversion mixing operation. The results are subsequently added and, after sim- 
ple on-chip bandpass filtering, are applied to a second set of quadrature mixers. Note 
that each quadrature pair of the first set of mixers constitutes a Weaver image-reject 
receiver together with the second set [5]. Also note that the first local oscillator (LO) 
frequency is set midway between 900 MHz and 1.8 GHz, i.e., the first mixing opera- 
tion is with high-side injection for 900 MHz and low-side injection for 1.8 GHz. This 
choice of frequencies makes the two bands images of each other, allowing the Weaver 
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architecture to select one and reject the other by addition or subtraction of the outputs 
of the second set of mixers. 

Owing to gain and phase mismatches, the image rejection ratio of the Weaver ar- 
chitecture is limited to 30 to 40 dB, inadequate for cellular applications such as GSM 
and DCS 1800. Nevertheless, since for each band the image is 900 MHz away, the 
front-end duplexer filters provide more than 40 dB of rejection. 

The distribution of gain, noise, and nonlinearity in the receiver chain plays a key 
role in the overall performance, necessitating iterations between architecture design 
and circuit design. Since channel-selection filtering is postponed to the stages fol- 
lowing the IF mixers, the third-order intercept point (IP 3 ) of each stage must scale 
according to the total gain preceding that stage. With the initial estimate of the IP 3 , 
the corresponding circuit is designed so as to minimize its noise contribution. 

The problem of secondary images [4] constrains the choice of the second IF in the 
Weaver topology. We consider two scenarios here. The present design provides quad- 
rature down-conversions to allow translation to the baseband. This approach, however, 
suffers from some of the difficulties encountered in direct-conversion receivers. For 
example, DC offsets due to the self-mixing of the second LO, flicker noise in the ana- 
log baseband circuits, and I and Q mismatch corrupt the down-converted signals [6,7]. 



-33 -33 




Figure 8.3 GSM receiver interference mask. 



Another possibility for the second down-conversion exists by virtue of the specific 
channelization in GSM and DCS 1800. As shown in the receiver interference mask 
of Fig. 8.3, the adjacent channels in a cell are unoccupied. This suggests that, if the 
second LO frequency is placed at the edge of the desired channel, then the secondary 
image coincides with one of the adjacent channels. Illustrated in Fig. 8.4, this tech- 
nique translates the center of the desired channel to 100 kHz instead of zero. Thus, 
flicker noise corrupts primarily the edge of the channel, and elimination of DC offsets 
by AC coupling becomes feasible. Furthermore, the down-conversion in the analog 
domain need not separate the signal into I and Q phases, thereby avoiding related 
mismatches. 

The above approach does introduce a difficulty: the channel-select filter now re- 
quires much sharper selectivity because 43 dBm interferers appear closer to the de- 
sired channel. This issue may be resolved by partitioning the filtering between analog 
and digital domains, albeit at the cost of tightening the linearity requirements of the 
A/D converter. 
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Figure 8.4 Near-baseband down-conversion. 




Figure 8.5 LNA/mixer combination. 



8.3 RECEIVER BUILDING BLOCKS 
8.3.1 LNA/mixer 

In order to achieve a relatively low noise figure and a reasonable input match, the 
LNA employs a common-source cascode stage with inductive degeneration (Fig. 8.5). 
To avoid uncertainties due to bond-wire inductance, both the source inductor and the 
drain inductor are integrated on the chip. The LNA draws approximately 5 mA from 
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the supply and exhibits a noise figure of less than 2.5 dB and an IP3 of greater than 
-2 dBm in each band. The parasitic capacitance of L 2 , the drainjunction and overlap 
capacitance of M 2 , and the input capacitance of the mixers resonate with L 2 at the 
frequency of interest. With a Q of about 3, this resonance lowers the image signal by 
approximately 10 dB. 

The LNA directly drives the quadrature RF mixers, which are configured as single- 
balanced circuits. Employing inductive loads to minimize thermal noise, each mixer 
drains 2 mA to achieve a reasonable trade-off between noise and nonlinearity. 

With 22 dB of voltage gain in the LNA, it is desirable to realize an IP3 of greater 
than + 15 dBm (l.SSVrrosfin the mixer, while maintaining its input-referred noise 
voltage below roughly 5nV/\/V.The dimensions of Ms-Mioin Fig. 8.5 has a strong 
impact on the performance of the RF mixer. Transistor M% is sized such that its over- 
drive voltage is sufficiently large to guarantee the required IP3. This is in contrast to 
bipolar implementations, where enormous emitter degeneration would be necessary to 
achieve an IP3 greater than +15 dBm. The key point here is that for a given bias cur- 
rent and IP3, a properly sized MOS transistor exhibits a much higher transconductance 
than a degenerated bipolar structure. Transistors A^and MioinFig. 8.5 also influence 
the noise and conversion gain of the mixer. The choice of the width of these devices 
is governed by a trade-off between their switching time and the parasitic capacitances 
they introduce at node P. For a given (sinusoidal) LO swing, M9 and A/10 are simulta- 
neously on for a shorter period of time as their width increases. A compromise is thus 
reached by choosing (W/L^io = 400 pm x 0.6pm, allowing the pair to turn off with 
a differential swing of 100 mV while degrading the conversion gain by less than 1 dB. 

The interface between the LNA and the mixer merits particular attention. As shown 
in Fig. 8.5, to achieve a well-defined bias current in the mixer, the LNA incorporates 
the DC load M3 with diode-connected devices M4 and M5. Neglecting the DC drop 
across L 2 , we note that Vqs 4 + VcS5 = Vgs3 + Vg 58- Thus, proper ratioing of M3and 
Mg with respect to M4 and Ms defines Id 8 as a multiple of I 2 . Capacitor C\ provides 
AC ground at the source of M3 so that the output resistance of M3[= 1 /(g m 3 + gm63)l 
does not degrade the Q of L 2 . Realized as an NMOS transistor, Ci consists of a large 
number of gate fingers to reduce the channel resistance, achieving a Q of greater than 
30 at the frequency of interest. 

In contrast to AC coupling techniques, the above approach incurs no signal loss, 
but it consumes some voltage headroom. Interestingly, M3 can serve as the current 
source for another circuit, e.g. an oscillator, thus re-using the bias current of the LNA. 

8.3.2 IFmixer 

The differential output of the RF mixers in Fig. 8.5 is capacitively coupled to the input 
port of the IF mixers, allowing independent biasing. With an overall voltage gain of 
about 26 dB in the LNA and the RL mixers, the nonlinearity of the IL mixers tends to 
limit the performance of the receiver. To this end, we note that a differential pair with 
a constant tail current exhibits higher third-order nonlinearity than a grounded-source 
pair using the same current and device dimensions. Shown in Fig. 8.6, the IF mixer 
is configured as a double-balanced circuit consisting of an input pair, a current multi- 
plexer, and a switching quad. The input devices draw a drain current of 1 mA and are 
sized to sustain an over-drive voltage of 500 mV, thereby achieving an IP3of approx- 
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imately +18 dBm (3.2 V rms ). The low transconductance of these transistors together 
with voltage headroom limitations ultimately results in a slight voltage conversion 
loss (about -2 dB) in the IF mixer. The switches in the current multiplexer negate the 
signal current according to the logical state of band while sustaining a drain-source 
voltage of approximately 35 mV. 




An important concern in heterodyne and image-reject receivers is the translation 
of various interferers to the desired channel frequency after down-conversion. Owing 
to nonlinearities and switching operations in each mixer, an interferer at cu, nt results 
in components at ka) i nt ± ma> L0 . With two down-conversions using (Olo, and o>lo 2 » 
the down-converted spurs appear at kco ml ± mco L oi ± nco LQ2 (Fig. 8.7), many of which 
may fall into the desired baseband channel. Since in-band interferers are not filtered 
before channel selection, and since they are located on the same side of a> LOI as the 
desired signal, they are not suppressed by the image-rejection technique used in the 
receiver. It is also important to note that the spurious response is not exercised in 
a simple noise figure measurement, but it reveals the performance of the receiver in 
a realistic environment. The spurious response of the dual-band receiver has been 
examined with the aid of a spreadsheet program. Five interference frequencies in each 
band were found to be the most significant sources of down-converted spurs. 

Another interesting phenomenon that results from the choice of the two LO fre- 
quencies is the in-band leakage to the antenna. The LOi-IFi feed-through and the 
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Figure 8.7 Generation of spurs. 



low Q of the BPF give rise to a significant 1350 MHz component at the input of the 
IF mixer. Upon second mixing, this component is translated to both 900 MHz and 
1800 MHz, potentially appearing as in-band leakage to the antenna(s). However, by 
virtue of differential signaling from the first IF onward, and by proper low-pass fil- 
tering at the output, this type of leakage can be suppressed to acceptably low values. 



8.4 TRANSMITTER DESIGN CONSIDERATIONS 

In order to minimize the number of oscillators and synthesizers, the receivers and 
transmitters in a dual-band system must be designed concurrently, with the frequency 
planning made such that the receive and transmit paths are driven by the same syn- 
thesizers. Although GSM and DCS 1800 use frequency-division duplexing (FDD) at 
the front end, their actual operation is somewhat similar to time-division duplexing 
(TDD) because their receive and transmit time slots are set off by 1.73 ms (three time 
slots) (Fig. 8.8). Thus, frequency synthesizers can be time-shared between the re- 
ceiver and the transmitter. Since the dual-band receiver incorporates LO frequencies 
of 1350 MHz and 450 MHz, it is desirable to utilize the same frequencies for the trans- 
mit path as well. 



TX 



J 










RX 











1.73 ms 



Figure 8.8 Time offset between receive and transmit time slots. 



Before considering suitable transmitter architectures, we review Gaussian mini- 
mum shift keying (GMSK) modulation to arrive at some of the design implications. 
Fig. 8.9 conceptually illustrates frequency shift keying (FSK) and GMSK. In FSK. 
rectangular baseband pulses are directly applied to a frequency modulator, e.g. a volt- 
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(a) 



BB 





(b) 

Figure 8.9 Generation of FSK and GMSK signals. 



age-controlled oscillator (VCO), thereby creating an output waveform given by 

*fsk(0 = Acos[<w c / + Kq j *bb(0^] > (8-1) 

where Kq is a constant denoting the "depth” of modulation and xbb(0 represents the 
baseband signal. 

An important drawback of FSK is the large bandwidth occupied by the modulated 
signal, partly because of the abrupt transitions in the frequency introduced by the sharp 
edges of the baseband pulses. We expect that if the frequency changes more smoothly 
from one bit to the next, then the required bandwidth decreases. In fact, the spectrum 
of the signal expressed by Eq. 8.1 decays in proportion to / 2(n+3) . where n denotes 
the highest continuous derivative of jcbbCO [9]. Based on this observation, GMSK 
modulation alters the shape of the baseband pulses so as to vary the frequency grad- 
ually. As shown in Fig. 8.9(b), the rectangular pulses are first applied to a Gaussian 
filter, thereby generating smooth edges at the input of the frequency modulator. The 
resulting output is expressed as 

xgmsk(0 = Acos[a; c r + Ko J xbb {t)*h(t)dt], (8.2) 

where h{t) = exp(— t 2 /r 2 ) is the impulse response of the Gaussian filter. 

The conceptual method described by Fig. 8.9(b) and Eq. 8.2 is indeed employed in 
some transmitters, e.g. for the Digital European Cordless Telephone (DECT) standard. 
However, if the amplitude of the baseband signal applied to the VCO or the gain of 
the VCO are poorly controlled, so is the bandwidth of the modulated signal. For this 
reason, in high-precision systems such as GSM, the waveform in Eq. 8.2 is rewritten 
as 



xgmsk(0 = Acos(o c tcosd - A sina> c f sin#, (8.3) 

where Q = Kof xbb(0 * h(t)dt, and cos# and sin# are generated using accurate 
mixed-signal techniques [4,10]. Equation 8.3 forms the basis for our transmitter de- 
sign. 
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In order to employ 450 MHz and 1350 MHz LO frequencies, we postulate that the 
transmitter must incorporate two up-conversion steps: from baseband to an intermedi- 
ate frequency (IF) of 450 MHz and from 450 MHz to 900 MHz or 18 GHz. We also 
recognize that a simple mixer driven by the 450 MHz IF and the 1350 MHz LO gener- 
ates the 900 MHz and 1.8 GHz signals with equal amplitudes, necessitating substantial 
filtering to suppress the unwanted component. It is therefore desirable to perform the 
second up-conversion by single-sideband (SSB) mixing. 

With the foregoing observations, we consider the topology shown in Fig. 8.10 as 
a possible solution. The baseband I and Q signals are converted up to 450 MHz and 
subsequently separated into quadrature phases by means of an RC-CR network, result- 
ingin Vm = AcosCaqr —6 — n/A) and Vs = <4sin(oqt — 9 — n/A). Single-sideband 
mixing of the IF and the second LO signals is then carried out by two mixers, with 
their outputs added or subtracted so as to produce the 900 MHz or 1800 MHz output 
according to the band select command. 




Figure 8.10 Simple dual-band transmitter. 



The architecture of Fig. 8.10 provides a compact solution for dual-band operation, 
but it suffers from several drawbacks. First, the RC-CR network introduces a loss of 
3 dB in the signal path and, more importantly, loads the first up-converter. Second, 
both of the outputs appear at the same port, making it difficult to utilize tuned narrow- 
band amplification at this port. Third, even with perfect matching between the quadra- 
ture paths in the SSB mixer, the variation of the absolute value of RC with process and 
temperature leads to a considerable amplitude mismatch between Vm and Vs .thereby 
creating a significant unwanted sideband at the output. For example, a 20 % error in 
RC results in an unwanted sideband only 20 dB below the wanted component. 

The existence of an unwanted sideband 900 MHz away from the desired signal 
may seem unimportant, because various filtering operations in the following power 
amplifier (PA) and matching network provide further suppression. However, sec- 
ond-order distortion in the PA — a significant effect, because PAs are typically sin- 
gle-ended — may lead to a troublesome phenomenon in the generation of DCS 1800 
signals. Illustrated in Fig. 8.11, the issue arises because the second harmonic of the 
900 MHz sideband falls into the transmitted DCS1800 channel. Since, from Carson’s 
rule [8], the second harmonic of a frequency-modulated signal occupies roughly twice 
as much bandwidth as the first harmonic, the 1800 MHz output may exhibit substantial 




900 MHz/1.8 GHz CMOS TRANSCEIVER FOR DUAL-BAND APPLICATIONS 



119 




Figure 8.1 1 Effect of unwanted sideband on DCS1 800 output. 




Figure 8.12 Increase in adjacent-channel power due to second harmonic of un- 
wanted sideband. 



adjacent-channel power (ACP), violating the transmission mask (Fig. 8.12). Thus, the 
unwanted sideband produced by the SSB mix er must be sufficiently small. 

In summary, the architecture of Fig. 8.10 requires two modifications: (1) the IF 
quadrature generation must avoid the use of RC-CR networks, and (2) the GSM and 
DCS 1800 paths must be separated at some point such that each can incorporate nar- 
rowband tuning. 

In order to produce the quadrature phases of the 450 MHz IF signal, we recognize 
that the baseband signal is available in quadrature phases, namely, cos# and sin# 
in Eq. 8.3. The IF signal can thus be generated in quadrature form as depicted in 
Fig. 8.13, where proper choice of the phases together with addition or subtraction 
at the output yields both cos(&qf + 0) and sin(&qf + #). Compared to the circuit of 
Fig. 8.10, this configuration both provides higher gain balance between the two paths 
and avoids the loss and loading of the RC-CR network while using two more mixers. 
The additional mixers consume more power, but since the conversion gain is higher 
in this case, the following SSB mixers require less power, leading to an overall power 
dissipation comparable to that of Fig. 8.10. 

The 450 MHz outputs of the first up-converter can now be multiplied by the quad- 
rature phases of the second LO and added or subtracted to generate the GSM and 
DCS 1800 signals. As mentioned above, it is preferable to design the second up-con- 
version modulators such that the 900 MHz and 1.8 GHz waveforms appear at the out- 
puts of two different circuits, thus allowing efficient narrowband amplification. This is 
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Figure 8.13 Up-conversion with quadrature outputs. 




Figure 8.14 Two-step up-conversion generating 900 MHz and 1 .8 GHz outputs. 



accomplished as shown in Fig. 8.14, where two independent SSB modulators produce 
the two bands according to the band select command. Note that narrowband tuning 
also suppresses the unwanted sideband resulting from mismatches in the SSB mixers. 
To save power consumption, only one of the modulators is active in either mode. 

The overall architecture of the dual-band transmitter is shown in Fig. 8.15. Since all 
of the signals up to ports A and B are differential, each band incorporates a differential 
to single-ended (D/SE) converter, applying the result to an output buffer. The trans- 
mitter requires external power amplifiers to deliver the high power levels specified by 
GSM and DCS 1800. 
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Figure 8.15 Dual-band transmitter architecture. 




Figure 8.16 (a) First up-conversion modulator, (b) implementation of load inductors. 



8.5 TRANSMITTER BUILDING BLOCKS 

In this section, we describe the transistor-level implementation of each building block, 
emphasizing the design constraints imposed by the architecture. The circuit topologies 
are identical for both bands, but device dimensions and bias currents are chosen to 
optimize the performance of each. 

8.5.1 First up-conversion 

The 450 MHz up-conversion modulator consists of two Gilbert cell mixers whose out- 
puts are added in the current domain. Shown in Fig. 8.16, the circuit utilizes resis- 
tive source degeneration, thereby improving the linearity in the baseband port of each 
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Figure 8.17 Simulated spectra of the two terms in Eq. 8.5 (Horiz. scale normalized to 
bit rate, vert, scale: 5dB/div). 



mixer. Two 100 nH inductors convert the output current to voltage. To minimize the 
area occupied by each inductor, a stack of three spiral structures made of three metal 
layers (Fig. 8.16(b)) is used [1], reducing the area by approximately a factor of 8 [11]. 
Since the polysilicon connection and the bottom spiral suffer from substantial parasitic 
capacitance to the substrate, this node is connected to the supply voltage, increasing 
the self-resonance frequency of the inductor. The quality factor of the inductor is 
estimated to be about 4 and the self-resonance frequency is about 600 MHz. 

Why must the baseband ports be linearized? Let us return to Eq. 8.3 with the 
assumption that cos0 and sin# experience third-order distortion. The resulting IF 
signal can then be expressed as 

*GMSk( 0 = Acos<y c r[cos0 + acos(30)] - Asina) c t[sinO + asin(30)], (8.4) 

where a represents the amount of third-order nonlinearity. Grouping the terms in 
Eq. 8.4, we obtain 



* gmsk (0 = Aco%[w c t + Ko I XBB(t)*h(t)dt] 

J f (8.5) 

+aAcos[<w c f + 3Ao / *bb (t)*h(t)dt]. 

Equation 8.5 reveals that third-order distortion gives rise to a component centered 
around co c , but with a modulation index three times that of the ideal GMSK signal. 
Invoking Carson’s rule, we postulate that the second term occupies roughly three times 
the bandwidth, raising the power transmitted in adjacent channels. Fig. 8.17 shows 
the simulated spectra of the two components in Eq. 8.5 with a = 1, indicating that 
the unwanted signal indeed consumes a wider band. For this reason, as depicted in 
Fig. 8.18, a must be small enough such that the transmission mask is not violated. 
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Figure 8.18 Effect of harmonic distortion at baseband ports. 




Figure 8.1 9 Voltage-to-current converter with output switching. 



In this design, the resistive degeneration and tail currents are chosen so as to ensure 
a % 0.01 with a 0.5 V pp baseband input, yielding a third-order component 40 dB below 
the desired signal. 

8.5.2 SSB modulator 

The signals generated at nodes X and Y in Fig. 8.16 must be “routed” to one of the 
SSB modulators according to the band select command. As illustrated in Fig. 8.19, the 
routing is performed in the current domain to minimize signal loss due to the addition 
of the switches. Capacitively coupled to the output of the 450 MHz up-converter, 
the voltage-to-current converter employs grounded-source input devices to save the 
voltage headroom otherwise consumed by a tail current source. The bias current of 
the circuit is defined by Mo and 1q. Note that S 1 -S 4 operate in deep triode region, 
sustaining a small voltage drop. Also, the linearity of this and subsequent stages is not 
critical, because GMSK signals display a constant envelope and are quite insensitive 
to spectral regrowth [4]. 

With the quadrature phases of the IF signal available in the current domain, SSB 
mixing assumes a simple topology. Shown in Fig. 8.20, the circuit senses the differen- 
tial current signals routed from each 450 MHz up-converter, performs mixing with the 
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Figure 8.21 Differential to single-ended conversion using (a) tuned current mirror, (b) 
negative resistance generator. 



second LO, adds the resulting currents with proper polarity, and converts the output to 
single-ended form. 



8.5.3 Differential to Single-Ended Converter 

In order to achieve a reasonable gain, it is desirable to employ tuning in the D/SE 
converter. Fig. 8.21 depicts two realizations of such a circuit. In Fig. 8.21(a), a current 
mirror together with two inductors creates resonance at nodes E and F, reducing the 
effect of device capacitances. The difficulty here is the large gate-source capacitance 
of the two PMOS devices, mandating a small value for L i and hence a low conversion 
gain. Fig. 8.2 1(b) presents an alternative topology where a PMOS device introduces a 
negative resistance in parallel with a floating inductor. It can be shown that 



ry O •'* 1 , 

CgC/rs^ Cgs CfS 



where Ce and Cf denote the total capacitance at nodes E and F, respectively [4], As 
a compromise between margin to oscillation and boost in gain, the negative resistance, 
~8m\/(C eC f(^)i is chosen so as to increase the Q of the inductor by approximately a 
factor of two. Note that Li can assume a relatively large value because it sees Ce and 
Cf in series. In this design, the signal is sensed at F because this port exhibits a lower 
output impedance. Simulations indicate that the topology of Fig. 8.21(b) provides 
about three times the voltage gain of the circuit in Fig. 8.21(a). 
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8.5.4 Output buffer 




Figure 8.22 Output buffer. 



The output buffer is shown in Fig. 8.22. Two common-source stages, M\ and M 2 , 
boost the signal level, driving the 50 £2 impedance of the external instrumentation. The 
bias current of M 1 is defined by /1 and Mo and that of M2 by / 2 and M3. Neglecting 
the AC drop across the inductor, we have Vcs3 + VcSi = VGS5 + VgS 2 ', that is, 1 02 
can be ratioed with respect to I 2 . 

8.6 CONCLUSION 

The design of dual-band transceivers poses many challenges in terms of frequency 
planning and circuit building blocks. An image-reject receiver and a two-step transmit- 
ter architecture have been introduced that provide dual-band operation with 450 MHz 
and 1350 MHz LO frequencies. Also, circuit techniques for low-noise amplification, 
down-conversion mixing, generation of the quadrature phases of the transmit IF signal, 
and differential to single-ended conversion have been presented. 
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9.1 INTRODUCTION 

Digital wireless systems such as GSM are in wide use worldwide and will continue to 
expand and replace the remaining analog systems at an increasing rate. The technical 
factors that have contributed to this success are fundamental capabilities such as easy 
use of SSB (single-side-band) signals, efficient channel multiplexing (time or code 
division), increased capacity through compression, resilience against interference and 
noise, and secure transmission. In addition, the development of relatively inexpensive 
and low-power wireless transceivers has triggered a great proliferation of consumer 
portable units. Several key technologies are enabling the successful design and manu- 
facture of these devices. Examples include GHz-band Si RF ICs, mixed-signal CMOS 
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VLSI ICs, high-quality RF and IF passives, inexpensive plastic packaging, and DSP- 
based processing. 

The block diagram of a typical modem transceiver is shown in Fig. 9.1. It consists 




audio 

data 

keypad 



etc. 



Figure 9.1 A typical wireless transceiver. 



of analog RF, IF, and baseband sections and a digital section. The analog/digital inter- 
face (i.e. A/D and D/A converters) is placed at baseband, a choice dictated primarily 
by cost and power dissipation considerations. Despite the usual preference for “digital 
whenever possible” designs, it is still to be shown that moving the analog/digital inter- 
face away from DC and closer to the antenna results in superior transceivers for hand 
held applications, for which cost and power dissipation are essential design specifica- 
tions. The hope is that, in the near future, advances in circuit fabrication technology 
or new radio design techniques will lead to an increased digitization of the transceiver 
implementation. In turn, this will bring more robust products with increased reliability 
and additional programmable features. 

Assuming that IF sampling will eventually prevail in portable applications as a step 
forward in the “more digital” design direction, this paper presents some basic issues 
and tradeoffs of this approach. Similar aspects have been discussed in other places 
such as in [1-3]. The emphasis will be on the receive path. It is shown that a major 
limitation in moving the A/D conversion closer to the antenna comes from the filtering 
requirements of the front end, a severe condition for inexpensive IC designs. However, 
contrary to a common belief, IF-sampled receivers do not require more stringent os- 
cillator jitter specifications than conventional receivers. Band-pass sigma-delta A/D 
converters with noise notch shaping at one quarter of the sampling frequency are well 
suited for IF sampling applications. In order to explain these facts, it is necessary to 
review the key aspects of transceiver architectures and design methods. 

9.2 INTEGRATED TRANSCEIVER CONVENTIONAL WISDOM 

A successful GSM-quality transceiver front-end is shown in Fig. 9.2. The receiver path 
consists of a single IF superheterodyne tuner and an I/Q IF-to-baseband direct-conver- 
sion stage. The transmitter includes a direct modulator with offset. This architecture 
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uses only two oscillators and allows frequency plans with relatively few spurious sig- 
nals in the RF and IF bands. A high level of integration is possible, but only in the 
presence of several high-quality off-chip filters. The dividing line between analog and 
digital is at baseband, where extensive digital processing is applied. 




Figure 9.2 A GSM-quality Integrated transceiver architecture. 



Focusing on the receiver, the typical processing of the worst-case antenna signal 
is illustrated in Figures 9.3a) and 9.3b). Using analog filtering, AGC, and frequency 
translation, a narrow frequency band of several channel widths centered on the desired 
channel is separated from the RF band and shifted to DC. Here, after more analog filter- 
ing and amplification, the signal is digitized. The final channel selection, equalization, 
and demodulation are done in the digital domain. The exact amount of processing at 
each stage of the receiver is carefully selected to optimize the overall transceiver per- 
formance, power dissipation, and cost. For example, it is not economical to separate 
the desired channel totally at IF since the proper IF filters would be too expensive. 

Next, the most important analog design considerations of the receiver will be dis- 
cussed briefly. The backbone of the receiver’s analog part is a cascade of filter-and- 
amplify sections. In Fig. 9.4 we see a common design tradeoff between filter atten- 
uation and amplifier gain in these sections. The input of each filter or gain block is 
allowed to contain a certain maximum level of out-of-channel interference according 
to the signal-swing capabilities of the actual circuits. The amount of post filter am- 
plification is designed such that the maximum interference at the filter output and the 
maximum interference at its input are equal. In this way the circuit linearity require- 
ments are uniform from section to section. Of course, since the various blocks operate 
at different frequencies and bandwidths, this is only a first-order rule. 

The frequency-conversion mixers connecting the filter-and-amplify sections are di- 
rectly responsible for several possible sources of receiver errors. The image and IF 
side-band generation problems are shown in Fig. 9.5 for the RF section. While the IF 
side bands are very far from the actual IF and can be easily removed, the rejection of 
the image frequency is a major design consideration. Active image rejection is pos- 
sible but limited in effectiveness (30-40 dB maximum) and power consuming. The 
most common practice is to use passive filters, as shown in Fig. 9.6. The nonlinear 
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Figure 9.3 a) Receiver front-end functions, b) receiver back-end functions. 
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Figure 9.4 Filtering-gain tradeoff (qualitative schematic). 



character of the actual mixer creates a further “image” problem due to mixing of LO 
harmonics with input signal harmonics. Fig. 9.7 demonstrates the generation of such 
spurious responses at the first IF. Therefore, the choice of IF frequencies is of utmost 
importance and involves compromises between filtering complexity and mixer spuri- 
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Figure 9.5 Mixer image and side bands problems. 
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Figure 9.7 Mixer spurious IF response. 



ous characteristics. Furthermore, the presence of DC offsets and practical mismatches 
limit the performance of the final I/Q conversion step. 

Another important aspect in receiver design is the reciprocal mixing effect. Fig. 9.8 
shows how the LO noise at offset frequencies is mixed to IF by large RF interfering 
signals (blockers). This is an important factor in the calculations of oscillator noise 
requirements and receiver sensitivity in the presence of many channels. 

9.3 IF SAMPLING: CONDITIONS AND LIMITATIONS 

The natural evolution of the receiver from Fig. 9.2 into an IF-sampled architecture is 
shown in Fig. 9.9. A first superficial examination would show that it is reasonable 
to expect this receiver to maintain all advantages of the original design with the addi- 
tional features of perfect I/Q matching, no baseband DC offsets, and requiring a single 
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Figure 9.8 Reciprocal mixing. 



A/D converter. These attractive characteristics are traded for higher speed in the A/D 
conversion operation and the addition of digital processing. It appears that, ultimately, 
the power dissipation required in actual implementations will determine the success 
of this approach. The power dissipation depends on circuits used and will improve 
with the scaling of IC technologies. However, even without assuming any specific IC 
capabilities, we can make several important observations about IF sampling including 
the realization that the previous examination of Fig. 9.9 is overly simplified. 




Figure 9.9 IF sampling receiver architecture. 
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The mixing of IF to DC, as shown in Fig. 9.2, has the important advantage of not 
requiring image rejection of any interfering signals. In addition, IF mixers and base- 
band anti-aliasing filters are relatively easy to design. When a sampler replaces these 
mixers, the IF filter is required to provide substantially more stop-band attenuation to 
avoid aliasing of blockers. This affects the design and the cost of the transceiver. The 
sampler implementation, which is more difficult than that of the mixer, has similar 
consequences. We have fundamentally conflicting requirements: a high IF makes anti- 
aliasing and RF image rejection filtering easy and the sampler design difficult, and 
vice versa. 

For a given sampling frequency fs, the maximum IF bandwidth is obtained for IF 
at fs/ 4. This is shown in Fig. 9.10. Since the first IF is usually 70 MHz or more, 
4 x -sampling circuits are difficult to implement. A better possibility is to apply under- 
sampling. If the receiver’s first IF is placed at an appropriate integer multiple of /s/4, 
where fs is a reasonably low sampling frequency, we obtain the same signal spectrum 
as in Fig. 9.10. 
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Figure 9.10 4X sampling of IF signals. 



This is equivalent to down-converting the signal to fs / 4 and then sampling. Nat- 
urally, the anti-aliasing requirements of the IF filter become increasingly difficult to 
meet at higher under- sampling ratios, as illustrated in Fig. 9.11. 




An additional important constraint is related to the relationship between the IF and 
the symbol rate. In classical architectures (sampling at DC), the two quantities are 
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independent, but in the IF sampling case, they are related by the fs /4 condition. This 
assumes an integer number of samples per symbol for easy digital demodulation. For 
example, a GSM IF-sampled receiver should use an appropriate multiple of 13/4 MHz 
for the IF and a multiple of 13 MHz for fs. A 52 MHz sampling rate and a 65, 91, or 
117 MHz IF would fit this rule. 



Sampler Mixer 




Oscillator Oscillator 



Figure 9.12 Sampler versus mixer. 



It is instructive to analyze further the potential limitations of under-sampling. In 
particular, the oscillator jitter is commonly alleged to be a trouble spot for this tech- 
nique. As illustrated in Fig. 9.12, we will do a comparison with the conventional 
mixer. One can calculate the effect of oscillator jitter with the help of Figures 9.13 
and 9.14. 
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Figure 9.1 3 Modeling sampling-oscillator noise. 



The output of the sampler S (kT), produced in reality by the noisy sampling of a 
pure input signal S(t), can be regarded as the noiseless sampling of an effectively jit- 
tery input S{t — r). Fig. 9.13 shows this equivalence, with r being a low-pass random 
process describing the oscillator jitter. If this process has the spectral density function 
Sir (A/), the oscillator and effective input SSCRs (signal-sideband-to-carrier ratios) 
are related as in Fig. 9.14. This is easily derived by observing that the variation SO(t) 
of the oscillator output 0(0 = A 0 sin(a> 0 / + <j> 0 ) is proportional to its time derivative 
A 0 o) 0 cos(a) 0 t + 4> 0 ) and r — St while the variation SS(t) of the effective input signal 
5(f) = A s sin(a>jt + <ps) is proportional to A s oj s cos (o) s t + <f > s ) and the same St. Elimi- 
nating this variable between the 50(0 and SS(0 expressions, the relationship between 
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the actual oscillator noise spectrum and the effective input signal spectrum is obtained. 
Graphically this dependence is represented in Fig. 9.15. 



NoIm (dB) Noise (dB) 




Figure 9.15 Effective input noise spectra of sampler. 



Applying the previous development to the RF input case in Fig. 9.16 shows clearly 
that sampling large blockers superimposes substantial oscillator noise over the desired 
RF signal. 

This is similar to reciprocal mixing in conventional mixers. Furthermore, when we 
compare the two effects in the equivalent frequency plans of Fig. 9.17 for the conven- 
tional and the IF-sampled cases, the oscillator jitter requirements become identical, as 
seen in Fig. 9.18. 

An interesting observation is that under-sampling effectively acts as down-conver- 
sion to a second IF at /s/4. In this view, the anti-aliasing conditions of the first IF 
filter become “image rejection” conditions for the down-conversion to the second IF. 
This interpretation shows intuitively why under-sampling and mixing require identical 
oscillator specifications. 

Comparing the intrinsic noise performance of the two circuits in Fig. 9.12, we ob- 
serve that input noise folding is present in both cases, albeit through different (but 
related) mechanisms. The IF filters reduce these effects. A more severe sampler prob- 
lem absent from the mixer is the aliasing of self thermal noise. The in-channel portion 
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Figure 9.16 Sampling blockers. 




Figure 9.17 Equivalent IF sampling and conventional frequency plans. 



of this noise for a given sampling frequency is AT/C divided by the over-sampling 
ratio with respect to the channel width. A potentially serious practical limitation for 
large under-sampling ratios is the use of small sampling capacitors dictated by circuit 
bandwidth requirements. Fortunately, in a well designed receiver, 70-80 dB of gain 
is present between the antenna and the sampler, reducing the AT/C noise effect to 
insignificant levels. This is similar to conventional sampling at the baseband. 

9.4 BAND-PASS A/D CONVERSION 

The discussions regarding Fig. 9.10 and 9.17 point out that band-pass sigma-delta 
A/D conversion is naturally suited for IF sampling. The circuit resources for obtaining 
high dynamic range are used only within the desired channel. The interfering input 
signals and the A/D quantization noise are filtered out in the decimator, whose digital 
signal processing capabilities mimic the conventional analog approach. The structure 
of Fig. 9.19 is identical to that of the discarded block in Fig. 9.9. 

The block diagram of a conventional band-pass sigma-delta modulator is shown in 
Fig. 9.20. It is important to verify the performance of this circuit in the presence of 
blockers. Computer simulations, which have been checked with extensive measure- 
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Figure 9.18 Equivalence of blocker sampling and reciprocal mixing. 
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Figure 9.19 Decimator functions. 



ments, are shown in Figures 9.21 and 9.22. The circuit behavior in Fig. 9.21 in re- 
sponse to increasingly stronger blockers is easily anticipated. The non-trivial increase 
in blocker insensitivity at high offset frequencies shown in Fig. 9.22 is unexpected but 
welcome. Further investigations are necessary to fully explain this property. 

9.5 CONCLUSIONS 

Designing IF-sampled transceivers is no simpler than designing conventional architec- 
tures, despite the promise and advantages of a more digital implementation. Careful 
tradeoffs between radio-system and circuit issues have to be made in the context of 
cost and power dissipation budgets. For narrow-band digital wireless systems, such as 
GSM, it is likely that IF-sampled portable transceivers will become practical. Based 
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Figure 9.20 Conventional band-pass sigma-delta modulator. 
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Figure 9.21 S/N ratio of A/D converter with 1 .6 MHz-offset blockers. 



on the considerations in this paper, it is reasonable to expect that such a solution will 
use a moderate level of under-sampling and passive IF filters. 
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Abstract: An overview of different transmit architectures is given. The major scope 
are the concepts used in the GSM system. A comparison with respect to integration 
level and system cost is given. The modulation loop concept commonly used today is 
discussed in detail, with respect to implementation details and trade-offs. 

10.1 INTRODUCTION 

The rapidly growing mobile communication market requires increasing the efforts to 
achieve higher integration levels. The ultimate goal is the integration of a complete 
mobile on a single die — preferably on a CMOS die. The example given in Fig. 10.1 
is still a dream. Today, we are far away from this goal. RF CMOS is more or less 
a research topic, although there are already a lot of publications dealing with this 
subject [1-6]. A demonstration of a fully functional GSM-like high-end system is not 
yet in sight. This would require a circuit which achieves the system specifications 
with respect to process, temperature and supply variations. Moreover, this target has 
to be met in a real environment, including on-chip bias circuitry. For these reasons, we 
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are presently far away from integrating the RF parts of high-end systems on a single 
CMOS die. 




It has not been demonstrated yet how to integrate large digital blocks along with 
analog RF blocks. Baseband digital signal processing typically requires a large die 
area even in advanced technologies. It is therefore desirable to follow the digital shrink 
path as fast as possible. From this point of view, the single-chip mobile might remain 
too expensive as long as reasonable cost reduction can be achieved by shrinking the 
digital functions. The design of the analog RF portion surely is the bottleneck in the 
product cycle. Therefore research for a single-chip mobile might be postponed until 
the dies are pad limited, which will provide the die area required for the RF blocks 
without additional cost. However, the system performance has to be achieved under a 
lot of constraints: 

■ reduced supply voltage might result in the need for very high VCO gain and in 
increased current consumption 

■ substrate noise 

■ low substrate resistance 

■ high process tolerances 

Due to the fact that the industry has to deliver products at optimum cost, perfor- 
mance and size, and at a certain point in time, we see an evolutionary development. 
That means the industry is relying on proven technologies, which are going to allow 
volume shipping at a low risk with respect to time-to-market. Therefore, architectures 
as given in Fig. 10.2 will be used in the next integration step. 

10.2 GSM TRANSMITTER REQUIREMENTS 

In order to derive the requirements of the transmitter building blocks, the GSM sys- 
tem specifications have been reviewed and translated into requirements related to the 
power spectral density. Fig. 10.3 is generated from [7], Sections 4.2.1 and 4.3.3, which 











TRANSMITTER CONCEPTS, INTEGRATION AND DESIGN TRADE-OFFS 



143 




GSM Spectral Mask ITX = 91 SMH; 




I ,5 



GSM Spectral Mask - B1SMHI 



GSM 05.06 4.2.1 - 
GSM 05 06 4 3.3a 
GSM 05 05 4 3 3b 
GSM 05 06 4.3.3 



I 



Figure 10.3 GSM unwanted emission requirements. 
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Figure 10.4 Spurious emissions. 



are dealing with emissions due to modulation and unwanted emissions. The worst 
case occurs while transmitting on the highest channel at 915 MHz, due to the spurious 
emission requirements in the mobile RX band 935 MHz to 960 MHz, given in Sec- 
tion 4.3.3 of [7]. In order to avoid interference with the reception of a nearby mobile, 
the maximum allowed power in a 100 kHz bandwidth is -79 dBm. Considering power 
amplifiers of class 4 — which means 2 W or 33 dBm — this results in a requirement 
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of -162dBc/Hz at 20 MHz offset as shown in Fig. 10.4. For DCS 1800 mobiles, this 
specification is relaxed to -7 1 dBm. 




Figure 10.5 Basic GMSK TX Architectures, a) Direct modulation, b) IF modulation 
with up-conversion mixing, c) modulation loop. 



10.3 BASIC GMSK TX ARCHITECTURES 

The basic architectures shown in Fig. 10.5 are commonly used today. The aim of this 
section is to give a comprehensive review of these architectures including a discus- 
sion of the advantages and disadvantages in terms of cost and implementation issues. 
Emphasis will be laid on spurious emissions with respect to nonlinearities. A qualita- 
tive study of the effects will be presented, helping to understand the design trade-offs 
which occur in the implementation of the different architectures. The power amplifier 
will not be included in this discussion. 

10.3. 1 Direct modulation 

In this section, the basic behaviour of a direct-modulation system as shown in Fig. 10.6 
will be studied. Most of the basic characteristics discussed here will also apply to the 
other architectures using the basic vector modulator at a different frequency. The first 
target of the modulator is to generate a GMSK modulation which is compliant with the 
system specification [7], Moreover, the given modulation mask sub-clause 4.2. lof [7] 
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Figure 10.6 Direct-modulation system: Spectrum due to modulation and wideband 
noise, a) Influence of digital resolution, b) Influence of lowpass filter. 



has to be fulfilled. This means that the unwanted power of a mobile unit measured at 
30 kHz offsets between 400 kHz and 1800 kHz has to be lower than 60 dBc, where the 
carrier power is also measured inside a bandwidth of 30 kHz. Including a reduction 
of 8.5 dB to take into account the reduced bandwidth used to measure the carrier, the 
requirement will be -1 13.3 dBc/Hz. The digital part has to deliver a resolution which 
is sufficient to achieve this requirement, because the anti-aliasing lowpass filters after 
the DACs cannot provide an additional attenuation below 1 MHz (see Fig. 10.6). The 
lower limit for the lowpass cut-off frequency is given by the required phase accuracy. 
In order to improve the performance or to reduce the ADC resolution, compensation 
in the digital domain might be used. The LO-synthesiser causes additional unwanted 
emissions at point C of Fig. 10.6 in the form of reference spurs, which have to be 
considered as well. 

The modulator, as shown in the grey box of Fig. 10.7, will have a wideband noise 
floor of about -140 dBc/Hz. In the TX chain after the modulator, no further narrow- 
band filtering is possible, which is due to the fact that the RF-LO is used to adjust the 
TX channel. Thus, the noise floor of the modulator itself will dominate the unwanted 
emissions above 2 MHz. 

Using a vector modulator as single-side-band modulator, a typical measurement 
result is given in Fig. 10.8a). Part b) of the figure shows simulation results of SSB 
modulation and a GMSK modulation. An ideal modulator would generate only a 
single tone for the SSB case, whereas the real modulator will generate a lot of different 
components. The image rejection will be limited by the accuracy of the phase shifter. 
There might be LO leakage as well due to DC offsets. The other tones are generated 
by the nonlinearities of the modulator. They will result in a wider spectrum as shown 
in the simulation part b) of Fig. 10.8. 
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Figure 10.7 Influence of modulator noise floor. 




Figure 10.8 Influence of nonlinearity within the modulator: a) SSB measurement, 
b) simulation SSB and GMSK. 



Therefore, nonlinearity is of major concern in the implementation of a vector mod- 
ulator. In order to understand the basic influence of the different nonlinearities, the 
first step is to separate them. 

Considering only limiting amplifiers in the input stage of the modulator as shown 
in Fig. 10.9 at first results in the tones given in the same figure. Assuming that the 
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Figure 10.9 Influence of nonlinearity within the modulator input. 



input nonlinearity is hard limiting, the input signals are square waves with a phase 
shift of 90 degree. It can be seen from the Fourier coefficients that the phase of the 
harmonics in one of the input paths will change by 180 degree, e.g., the 3rd harmonics 
compared to the fundamental. This means for the 3rd harmonics the upper sideband 
will be rejected and the lower sideband located at /lo — 3 f m will occur. In Fig. 10.10, 
only nonlinearities after the LO phase shifter and in the output stage are considered. 

For reasons similar to the ones discussed previously, the input signal will be mod- 
ulated onto the lower sideband of the 3rd harmonics of the LO. That means a tone 
will occur at 3/lo — fm - Because of the output nonlinearity, an intermodulation prod- 
uct will be generated at — /lo + 3 f m , which is mirrored to the positive frequency 
/lo - 3 f m . 

Finally, taking the nonlinearity of the power amplifier into account, additional in- 
termodulation products will be generated as shown in Fig. 10.11. 

What does this really mean for the system? The nonlinearities will generate a wider 
GMSK spectrum. There are two ways to meet the system specifications: The first is to 
make the modulator as linear as possible, thereby avoiding the wider spectrum. This 
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Figure 1 0.1 0 Influence of nonlinearity within the modulator LO path and output stage: 
generated SSB tones. 




Figure 1 0.1 1 Influence of nonlinearity within the power amplifier. 



results in an increase in power consumption. The second way is to make the power 
amplifier more linear in order to reduce the intermodulation. However, the result with 
respect to power is even worse. Accordingly, this is surely not the way to go for a 
battery-powered mobile. 

Another quite important requirement is the wideband noise floor at the modulator 
output. As shown in Fig. 10.3, the TX in band requirements are -133 dBc/Hz below 
20 MHz offset and -138 dBc/Hz above. In order to achieve these specifications, an 
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output power of about 0 dBm is required, which is hard to achieve when using a CMOS 
implementation. 

The direct-up-conversion architecture shown in Fig. 10.6 has the drawback that the 
RF-VCO operates at the output frequency. Therefore, depending on the quality of 
shielding, a re-modulation may occur. This will result in a phase error which might be 
too high. In order to avoid this problem, almost all commercially available implemen- 
tations use an offset mixing scheme in order to generate the desired output frequency. 

10.3.2 IF modulation 
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Figure 10.12 IF modulation. 



In the last section, the basic characteristics of the vector modulator have been dis- 
cussed intensively in combination with direct up-conversion architecture. Most of the 
issues apply as well to the IF modulation architecture given in Fig. 10.12. The IF mod- 
ulation has a major advantage compared to the direct up-conversion. First, the re-mod- 
ulation problem does not occur and the isolation requirements are relaxed. Second, 
a narrow IF-SAW filter might be used to reduce the unwanted emissions. Moreover, 
the architecture is well suited for dual-band applications (GSM in combination with 
DCS1800). The IF-modulator will be used for both systems at a common IF, whereas 
two up-conversion mixers are used to generated the different TX frequency bands. 

As described in Section 10.2, a noise floor of -162dBc/Hz has to be realized 
in a GSM system. The direct-up-conversion architecture requires a duplexer in the 
front-end because the noise floor generated by the modulator is quite high. The De- 
modulation system comes up with a lower noise floor, which might be in the range 
of -160dBc/Hz. In order to guarantee the performance in high-volume production, 
something like -165 dBc/Hz would be needed. Therefore, the IF-modulation archi- 
tecture cannot avoid the expensive duplexer. During the last few years, the modulation 
loop architecture has reached maturity and is the only system approach which avoids 
the duplexer. 

10.3.3 Modulation loop 

As already mentioned, the modulation-loop-based transmitter of Fig. 10.13 has be- 
come quite popular. The first reason is the fact that no duplexer is needed. A second 








150 CIRCUITS AND SYSTEMS FOR WIRELESS COMMUNICATIONS 




Figure 10.13 Modulation-loop-based transmitter. 



major advantage is the dual-band capability of the architecture. The modulation loop 
is more or less a PLL in which the reference of the phase detector is modulated. In 
the feedback path of the loop, a down-conversion mixer is used to provide the down- 
converted VCO signal to the phase detector. The RF-VCO provides the LO for the 
down-conversion mixer, synthesised by a conventional PLL in order to adjust the de- 
sired channel. 



TX-IF 




Figure 10.14 Modulation loop: modulate reference path. 



Due to the high reference frequency of 100 MHz or higher (refer to Fig. 10.14) 
there are several hundred phase comparisons for one transmitted bit. Therefore the 
loop is able to copy the phase modulation of the reference path onto the output of 
the power VCO, which is within the loop. The GMSK modulation used in GSM-like 
systems is a pure phase modulation and does not exhibit any amplitude modulation. 
The power-VCO provides a very low noise floor at 20 MHz offset — typically less than 
-165 dBc/Hz. This VCO is realized as a module and is capable of driving the power 
amplifier directly without additional noisy buffer stages. 

The configuration can be seen as a transformation of an RF bandpass into a lowpass 
filter. 

In Fig. 10.15, the principal block diagram of an up-conversion loop with modula- 
tion in the feedback loop is given along with its frequency plan. This configuration 
requires that the output of the vector modulator within the feedback path does not 
exhibit any phase modulation. Accordingly, the VCO must have phase modulation 
which is cancelled by the vector modulator in order to fulfill the requirements of the 
phase-locked loop. 
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Figure 10.15 Modulation loop: modulate feedback path. 
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Figure 10.16 Modulation loop: Simulation schematic for loop dynamics. 



The GSM system requires an RMS phase accuracy of under 5 degrees, which calls 
for a high cut-off frequency of the loop filter. On the other hand, only the VCO wide- 
band noise should be present at 20 MHz frequency offset. In Fig. 10.16, an APLAC 
simulation setup [8] is given which is used to study the different noise contributions 
within the modulation loop. The different noise sources, the corresponding transfer 
functions and the resulting output noise contributions with respect to the major build- 
ing blocks of Fig. 10.15 are given in Fig. 10.17 and Fig. 10.18. 

Outside the modulation bandwidth and below 5 MHz, the vector modulator is the 
dominant noise source, where the noise of the modulation itself is dominated by the 
digital noise of the I-Q signal generation. Above 10 MHz, the noise is dominated 
by the noise of the power VCO. The phase-noise requirements of the IF-LO and the 
RF-LO are relaxed due to the attenuation of the transfer function within the loop. 
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Upconversion Loop Noise Contributions 




Figure 10.17 Overall noise and different contributions within the modulation loop. 




Figure 1 0.1 8 Noise contributions and transfer functions of the major building blocks. 



10.4 IMPLEMENTATION RESULTS 

A major trade-off in designing the loop transfer function is to keep the bandwidth 
high enough to keep the phase error low while achieving the -162 dBc/Hz at 20 MHz 
offset. As shown in Fig. 10.19 and Fig. 10.20, a transmitter based on the PMB 2251 
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Figure 10.19 Measured output spectrum of a PMB 2251 modulation loop. 
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Figure 10.20 Measured phase noise within the RX band at 20 MHz offset. 



modulation is able to achieve this goal. Detailed information on the application circuit 
and the measurement setup can be found in [9]. 

Fig. 10.21 shows the results of an SSB-modulation generated by the PMB 2251 
modulation loop. Due to the limiting behaviour of the type-4 PFD used in this appli- 
cation, the unwanted tones are symmetrical with respect to the wanted signal. The 
PFD removes any AM components, meaning that only a PM component remains. The 
IM3 components are quite low at -60dBc. Therefore, the modulation mask can be 
achieved easily by using a power amplifier with relaxed specifications. The CMOS 
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Figure 10.21 Measured single-side-band spectrum of a PMB 2251 modulation loop. 



DCS1800 direct-conversion modulator reported in [1] exhibits an IM3 suppression of 
35 dB. 

10.5 CONCLUSIONS AND FUTURE REQUIREMENTS 

The increasing number of cellular subscribers requires the implementation of opti- 
mised architectures of multi-band systems combining at least two standards out of 
GSM, DCS 1800 and DCS 1900. Therefore, problems like choosing the best possible 
frequency plan or reducing the number of VCOs may dominate the R&D for at least 
the next generation. Another challenge will be the implementation of multi-mode sys- 
tem like DCS 1800/DECT or GSM/UMTS. In these combinations, we have to cope 
with different data rates and modulation methods. This will have a significant influ- 
ence on the implementation and may require a step back in the integration level. 

Another issue is the reduction of size and weight of the cellular phone. This will 
require improved batteries along with reduced power consumption. Concerning the 
talk time, the efficiency of the PA has to be improved in some way. The standby time 
is important to the user as well, meaning that the power consumption of the receive 
path has to be reduced. 

As shown for the TX implementation in this Chapter, the overall system design 
requires an optimum balance between a lot of different effects. Taking the presented 
problems into account, an increased integration level may result in higher system cost. 
However, from the commercial point of view, system cost is the major driving force. It 
really does not matter which technology might be used: digital CMOS only or digital 
CMOS, analog CMOS, BiCMOS or GaAs. A minimum system cost with respect 
to a certain point in time has to be achieved. Therefore, issues like yield, design 
cycles and availability of deep sub-micron technologies have to be considered. The 
final conclusion from the authors’ point of view is that reasonable integration steps 
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are required depending on the system requirements, time schedule and the available 
technology. 
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Abstract: Wireless telecommunications will enter a new era in the coming years. 
New systems and new customer needs are already introducing vast challenges, in par- 
ticular for the implementation of the terminals. The RF section is a major contributor to 
the size and cost of the terminal. Consequently, the winning terminal implementations 
will be those with the best RF section. 

None of the implementation technologies alone seem to provide the full answer to 
the RF challenges. Instead, the winning RF will be achieved by careful joint optimisa- 
tion of the overall technology portfolio with the proper radio architecture. 

11.1 INTRODUCTION 

From the early 1980s when the first generation of analog systems NMT, AMPS, and 
TACS were first introduced, the cellular phone market has exploded to be one of the 
major consumer product markets. By the end of 1997 there were close to 207 million 
cellular subscribers worldwide. There was an increase of 70 million during 1997 
alone. According to some predictions the number of cellular subscribers will exceed 
one billion by 2005 [Nokia Press Release, September 1998]. As a world record for the 
cellular phone penetration, Finland has just exceeded 50% of the population (August 
1998). Currently, digital narrow-band systems led by GSM continue to conquer the 
world. In the near future, however, the voice-based second-generation cellular systems 
will be complemented by new systems. The main interest in Europe is toward the 
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WCDMA radio access technology that ETSI selected in January 1998 to be the basis 
for the Universal Mobile Telecommunication Service (UMTS), also referred to as the 
third-generation cellular system. The driving force behind the WCDMA system is the 
increased need for data capacity, particularly for mobile multimedia applications. 

Furthermore, there is a growing interest toward wireless data-communication ap- 
plications. The “Bluetooth” concept, 1 working at 2.4 GHz, will provide low capac- 
ity short range wireless data transfer for various applications and accessories. The 
Wireless LAN systems at 2.4 and 5 GHz will increase the wireless data transmission 
capacity in office environments. 

Although many of the new systems are targeted to be global, the trend is toward 
higher diversity of co-existing systems. As a result, terminals offering access to sev- 
eral systems are needed, and the integration becomes extremely challenging. In par- 
ticular, the RF section — being a major contributor to the wireless terminal size and 
cost — faces requirements that push the technologies to their limits. 

The objective of this article is to discuss the main trends affecting the RF implemen- 
tation of tomorrow’s wireless terminal: the emerging wide-band systems, upcoming 
multi-system terminals, continuous miniaturisation, and cost-reduction. The first two 
items are emerging new trends, whereas the latter items are more evolutionary ones. 

1 1 .2 CAPACITY FOR WIRELESS MULTIMEDIA 

The wide-band wireless communication systems are finally moving forward from fea- 
sibility studies to serious development programs. The main advantage of the new 
systems, as advertised, are the high bit-rates ranging from hundreds of kbit/s up to 
2Mbit/s for cellular wide-band. In WLAN systems peak bit-rates could even exceed 
20Mbit/s. The result of the high capacity is the increasing need for DSP computational 
power. In terminals, the requirements for the wide-band data flow are increasing up to 
hundreds or thousands of MIPS. Fortunately, the increasingly complex DSP becomes 
feasible as the evolution of the sub-micron CMOS technology continues. 

One reason for the capacity being so high is the use of spectrum-efficient modula- 
tion methods. In order to meet the demanding specifications of the transmitted signal’s 
spectrum, the transmitter has to be highly linear. For example, the adjacent-channel 
power level must be in the order of -45 dBc for the wide-band cellular terminals. 
With linear power amplifiers, it is extremely difficult to achieve a sufficient battery ef- 
ficiency; typically, the battery efficiency falls into the region of 5 to 20 %. This is not 
satisfactory for cellular phone users who are used to relatively long talk times resulting 
from battery efficiencies above 60 %. 

The efficiencies can be boosted up with different transmitter linearisation schemes. 
Some of the most promising methods are local negative feedback, pre-distortion, and 
feed-forward linearisation, illustrated in Fig. 11.1. Using the local negative feedback 
is highly desirable, since it does not affect the radio architecture, and since it also 
makes it possible to integrate the linearisation circuitry with the power amplifier. 

For the pre -distortion, there are several alternative approaches: analog, digital fixed, 
and digital adaptive pre-distortion. The analog pre-distortion scheme is difficult to con- 
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Figure 11.1 Power amplifier linearisation concepts: (a) local negative feedback, 
(b) fixed analog pre-distortion, (c) adaptive digital pre-distortion, (d) feed-forward. 
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trol over the variations of temperature, supply voltage, and circuit parameters. With 
the fixed digital scheme, the system can be taught for different conditions; still, ma- 
jor variations of the nonlinear characteristics of the transistors are extremely difficult 
to control. An adaptive concept would be flexible for varying conditions, but it suf- 
fers from the overall complexity. The digital pre-distortion schemes introduce mod- 
ifications to the whole transmitter chain: wider bandwidths of the filters and better 
resolution of the D-to-A converters. 

For achieving the ultimately linear operation, the feed-forward linearisation has 
the highest potential. The feed-forward linearisation is truly adaptive to the different 
conditions. Still, the method suffers from the complicated control of the phase and 
amplitude characteristics of the two loops, and the power consumption can be unsatis- 
factorily high for hand-held terminals. 

An additional difficulty, in particular for the CDMA systems, is the large power 
control range, being in the order of 60 to 90 dB. Maintaining the high linearity over 
a wide range of power levels is a challenge for the power amplifier design. CDMA 
systems do not often operate at the maximum power level. Maintaining an adequate 
efficiency at moderate output powers is therefore an extremely important objective. 

The low efficiency combined with the increasing packaging density can lead to heat 
transfer problems. To overcome these problems, mechanical and thermal design have 
to be integrated with the electrical design flow of future terminals. 

Other new requirements of high bit-rate systems include fast-settling synthesisers 
for variable-capacity TDMA systems and high-speed A-to-D and D-to-A converters 
for the baseband section. 

11.3 MULTIPLE RADIOS IN ONE UNIT 

As the wide-band systems are coming, it is important to notice that the R&D for the 
second generation terminals (GSM, GSM1800/1900, IS- 136, IS-95, PDC ... ) contin- 
ues actively; the market is not showing any signs of saturation. Multiple systems will 
co-exist in the same geographical areas. Accordingly, there is a market need for termi- 
nals offering access to several systems. The non-compatibility of the different systems 
causes difficulties for the implementation. In the baseband section, core platforms for 
several radio access protocols can be implemented, whereas in the RF section it is 
more difficult to design core solutions cost-efficiently. 

The key task in implementing the RF section for a multi-system multi-band termi- 
nal is to minimise the number of parallel functions. Many of the active functions, 
such as variable-gain amplifiers and mixers, can be designed to be adaptive to differ- 
ent specifications, whereas passive circuits, such as filters and matching circuits, need 
to be selectable for each designated frequency band. High-frequency filter technolo- 
gies are clearly the key for finding implementation solutions for future miniaturised 
multi-mode terminals. An example RF architecture of a dual-mode terminal is shown 
in Fig. 11.2. 

None of the semiconductor technologies alone will provide an optimum platform 
for the multi-mode transceivers. The best and only way to find the winning RF solu- 
tion is to optimise the whole RF technology portfolio including the ICs, packaging, 
substrates, passives, and filters with the RF architectures for the systems selected for 
a multi-system product. BJT, BiCMOS, CMOS, SiGe, GaAs, SOI, and others will all 
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Figure 11.2 Example RF architecture of a dual-mode terminal. 



remain possible enabling technologies for the optimum RF solution, but not targets by 
themselves. The micro-electro-mechanical systems (MEMS) will also eventually find 
their way into wireless terminals. Possibly, MEMS could offer the needed tunability 
for the passive functions: filters and matching circuits. 

In addition to the component technologies, radio architectures more suitable for 
multi-mode operation can be applied. As a part of the overall architecture, the selec- 
tion of reference clock frequencies is of utmost importance. 

11.4 THE SMALLER THE BETTER 

There seems to be more market demand on ultimately small phones than first pre- 
dicted. Moreover, miniature phone engines are vital for the multi-mode terminals in 
which the electronics of two or more terminals are integrated into one product, since 
the new multi-mode terminals cannot be larger than current high-end single-mode cel- 
lular phones. 

Figure 11.3 shows the development of the volume of a typical GSM hand-portable. 
The corresponding component count in the RF section is shown in Fig. 11.4. The sig- 
nificant improvements in size and component count of the hand-held digital cellular 
phones have been achieved by increasing the level of integration in the phone. Cur- 
rently, all active functions are integrated, even in the RF section. On average, there are 
currently three RF ICs in a terminal. Simultaneously, there are approximately 100 pas- 
sive discrete components in the RF section, as can be seen in Fig. 1 1.4. These discrete 
components dominate the size; consequently, it is obvious that reducing the number of 
discrete components is much more important than integrating the remaining ICs into 
a single chip. 

In the evolutionary prediction illustrated in Fig. 11.3 and 11.4, the main improve- 
ments are made by applying new packaging and interconnection technologies. The 
old microwave hybrid technology is reborn with a slightly new appearance: custom 
ICs are assembled using chip-scale miniature packages, e.g. flip-chip, on a high-per- 
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Figure 11.3 Development of volume (in cubic centimetres) of a typical GSM hand- 
portable terminal. 



formance substrate with passive filters, interconnections, and external discrete compo- 
nents. 

More revolutionary improvements in the miniaturisation might be achieved with ad- 
vanced transceiver architectures shown in Fig. 11.5. The direct conversion architecture 
can be applied to some systems to remove the bulky IF filters. An ultimate goal could 
be an all-digital transceiver adaptive to all major wireless systems. In the all-digital 
transceiver, the sampling is carried out at the signal frequency. For several reasons, 
the approach is not yet feasible. The power consumption of the required Gbit/s signal 
processing and the high-resolution high-speed A-to-D conversion exceeds many times 
the acceptable level for portable devices. Furthermore, the stopband rejection specifi- 
cation for the remaining RF filter becomes extremely demanding: approximately 100 
dB for narrow-band TDMA systems. With today’s technology, this would lead to a 
filter having a physical size too large for hand-portable terminals. 

11.5 ... FOR THE LOWEST COST 

The prices of the cellular phones have dropped by tens of per cent annually over the last 
years. In order to maintain the competitive edge, the total cost of the product needs to 
be reduced. This trend will continue in the near future, and there is even more demand 
for cost reduction since the more complex — and hence more expensive — wide-band 
and multi-mode terminals are coming. It seems evident that customers are not willing 
to pay more for the new advanced terminals. 
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Figure 11.4 Development of component count in the RF section of a typical GSM 
hand-portable terminal. 



Fortunately, the technologies that enable the miniaturisation provide, for the most 
part, also cost-reduction. The key issue is to optimise the radio architecture and the 
technology portfolio for the lowest total cost and still meet the specified performance. 
The production volumes of cellular phones have reached such high figures that in some 
technology areas, e.g. analog and RF ICs, the cellular phones have become the driver 
application. As a result, large terminal manufacturers can have a significant influence 
on the price and the development of the technology. The cost breakdown of the typ- 
ical RF section is shown in Fig. 11.6. The cost implication of the direct-conversion 
approach versus the super-heterodyne is also shown. 

The duration of the design cycle is a major contributor to the total cost of the ter- 
minal's RF section. A key goal, in particular in RF IC design with its long turnaround 
time, must be the first-time success. In this respect, the quality of the design envi- 
ronment and the device models play a critical role. The RF design environment must 
be highly integrated to offer seamless system- and transistor-level simulation capabil- 
ities, access to electro-magnetic simulation tools, and capability to extract parasitics 
and substrate couplings from the layout. With the increasing packaging densities, the 
electro-dieimal simulation becomes a necessity as well. 

11.6 CONCLUSIONS 

The overall goal, namely to be able to design the world's best miniature multi-mode 
wide-band terminals with the ultimately lowest cost, is so challenging that it requires 
RF designers to do miracles in the coming years. New technologies introduce new 
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Figure 11.5 Radio front-end architectures: (a) super-heterodyne, (b) direct- 

conversion, (c) all-digital receiver. 



solutions but, at the same time, also new problems. None of the implementation tech- 
nologies will, however, offer a clear winning RF solution. Only by improving all 
factors — technologies, architectures, and design environment — the optimum lead- 
ing edge RF section can be developed. For innovative RF development, the designers 
must master larger systems and there is a clear need for multi-discipline experts, e.g., 
in RF, DSP, and materials areas. With hard work, good vision, and committed design 
teams we will see that these mir acles will take place — as so many other technological 
breakthroughs have in the past. 
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Figure 11.6 Relative cost breakdown in the typical RF section: 2000S is an evolu- 
tionary prediction for super-heterodyne transceiver and 2000D for direct conversion, 
respectively. The scale is normalised to be 100 in 1998. 
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12.1 INTRODUCTION TO THE FOLLOWING PAPERS 

Radio-frequency (RF) circuit design used to be realized using discrete bipolar devices, 
and later integrated bipolar technology was used as well. However, because of the 
dramatic growth of mobile communications and the need for providing highly minia- 
turised, cheaper equipment, there has been a recent surge of interest in realizing RF 
circuits in CMOS technology. 

One of the challenges in RF design concerns the design of the front-end receiver 
section, whose main role is to interface the radio frequency bands allocated for com- 
munication with the baseband unit, which performs the bulk of the information pro- 
cessing functions needed in modem radio communications. Conceptually, such an RF 
front-end receiver can be viewed as a mixed-signal processing and conversion chain 
which combines a tunable bandpass filtering function, an analog-to-digital converter 
(ADC), and a digital signal processor (DSP), as schematically illustrated in Fig. 12.1. 

Whilst previous sections of this book have been concerned with the practical real- 
ization of circuit and system solutions that provide the equivalent functionality of the 
tunable bandpass filter, this section is concerned with the design and integrated-circuit 
implementation of the analog-to-digital conversion function. At a later stage, the de- 
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Figure 12.1 Simple conceptual representation of an RF receiver. 



sign and implementation of dedicated DSP units for communication applications will 
also be considered. 

Most of the work and design solutions reported to date have addressed the prob- 
lem of narrow-band receivers, whereby only one channel is selected for baseband 
processing from the available multi-channel RF band [1], Therefore, as indicated in 
Table 12.1, the effective channel bandwidth to be digitised by the ADC has varied 
typically from about 25 kHz for narrow-band frequency modulated receivers to ap- 
proximately 1250 kHz for receivers based on spread-spectrum processing techniques. 
Neither represents a particularly stringent requirement for the design of integrated 
ADCs [2-6]. 



Table 12.1 Typical radio bandwidths and channel spacings for popular mobile radio 
systems. 



Standard 


Radio Bandwidth [MHz] 


Channel Spacing [kHz] 


PDC 




25 


GSM 


25 


200 


DCS- 1800 


75 


200 


DECT 


20 


1728 


IS-95 (CDMA) 


25 


1250 


AMPS 


25 


30 



The present conservative situation, however, is changing very rapidly, due to the 
insatiable demand for wider-band mobile communication systems capable of support- 
ing high-speed data transmission as well as multimedia applications. Not too far in the 
future, the ADCs in radio receiver front-ends are expected to digitise effective signal 
bandwidths from a few MHz for hand-held terminals of the universal mobile telecom- 
munication system (UMTS) to as much as 20-30 MHz in base stations implementing 
the software radio concept. High dynamic range, low spurious components, low-volt- 
age operation, and low power dissipation are additional requirements which, besides 
wide bandwidth, significantly add to the design complexity of such ADCs. 

This part of the book addresses the problem of designing high-performance, wide- 
bandwidth integrated ADCs suitable for future generations of mobile communication 
systems. First, Roovers introduces the performance metrics that are needed to eval- 
uate the feasibility of receiver architectures with wide-band ADCs and compare the 
relative performance of alternative forms of realization. It is shown that when the per- 
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formance of an ADC is mapped onto the requirements of a receiver architecture with 
multi-channel processing, it should be able to provide not only wide bandwidth but 
also high dynamic range with respect to noise limitations, distortion, and spurious sig- 
nals. Since the combination of wide bandwidth and stringent dynamic range demands 
high power dissipation, such ADCs are not yet adequate for deployment in hand-held 
terminals, where energy consumption is under tight control. Nevertheless, when such 
ADCs are used in combination with some form of digitally-controlled signal amplifi- 
cation, they can already find useful applications in base stations, as it is illustrated in 
a case-study developed for a GSM receiver. It is expected that the continuing research 
of innovative circuit techniques combined with the availability of higher-performance 
technologies, including the emerging SiGe, will make such converters also widely 
used in portable equipment in the not too distant future. 

Bang-Sup Song examines the current status of high-performance ADCs and dis- 
cusses the fundamental techniques, namely pipeline-based and folding architectures, 
which are expected to enable the realization of at least Mbit conversion linearity over 
an effective signal bandwidth higher than 25 MHz. The performance of pipeline ADCs 
is fundamentally limited by the speed and linearity of the interstage residue amplifiers, 
while the offset of the folding amplifier and overall circuit complexity appear to be the 
major limitations of folding ADCs in achieving high-performance operation. For both 
architectures, several improvements are discussed in order to increase effectively both 
the number of conversion bits and the conversion speed while dissipating significantly 
lower power than current solutions. Predictions based on recent work and trends indi- 
cate that high-performance pipeline ADCs will be capable of achieving 15 bit to 16 bit 
linearity with a spurious-free dynamic range over 90 dB at a 100 MHz sampling rate, 
and which will, therefore, make the widespread use of software radios feasible. 

In the last contribution to this part of the book, Azeredo-Leme et al. begin by exam- 
ining the contributions and requirements of different building blocks in a traditional 
receiver front-end architecture, with the aim of allowing a more relaxed realization of 
filters and synthesisers at the expense of a more complex ADC. For this purpose, they 
introduce the concept of an image-rejection sub-sampling ADC, which is a combina- 
tion of a highly oversampled ADC together with a sub-sampling block implementing 
a double-quadrature mixer. It is shown that, for present mobile communication stan- 
dards, this allows the digitisation of multiple radio frequency carriers and hence en- 
abling the final channel tuning operation to be carried out fully in the digital domain. 
An additional advantage offered by this approach is the digital control over selectivity 
and tuning resolution, which lead to simplified implementations of multi-mode termi- 
nals. It is further shown that no penalty to power dissipation arises when compared 
with classical implementations. 
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13.1 INTRODUCTION 

Mobile communications have enjoyed a very fast growth and acceptance virtually 
across all continents. However, today we observe a multitude of standards with lit- 
tle or no compatibility with each other. They differ both in the allocated bands and 
in the modes of operation (FDMA, TDMA, CDMA to mention only the multiple ac- 
cess techniques) as shown in Tab. 13.1. This has created the need for multi-band and 
multi-mode terminals. 

However, the classical super-heterodyne architecture, with the large number of ac- 
curate frequency selective filters it requires, makes it difficult to build such a multi- 
standard radio in a way compatible with today’s demand for very high level of integra- 
tion and long stand-by times. 

This led to the development of the concept of “Software Radio” where the func- 
tions of channel selection and filtering would be done digitally by software. For ex- 
ample, the telephone would have a memory module with the available standards at the 
purchase date, and updates to new standards could be loaded later from a computer 
connection. 
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Table 13.1 Standards parameters for popular mobile radio systems. 



New 

Standards 


Centre 

Band 


Channel 

Spacing 


Multiple 

Access 


Modulation 


PDC 


810-826 MHz (Rx) 
940-956 MHz (Tx) 


25 kHz 


TDMA 


7t/4 DQPSK 


GSM 


935-960 MHz (Rx) 
890-915 MHz (Tx) 


200 kHz 


TDMA 


GMSK 


DCS- 1800 


1805- 1880 MHz (Rx) 
1710-1785 MHz (Tx) 


200 kHz 


TDMA 


GMSK 


DECT 


1880-1900 MHz 


1728 kHz 


TDMA 


GMSK 


IS-95 


869-894 MHz (Rx) 
824-849 MHz (Tx) 


1250 kHz 


CDMA 


QPSK 


AMPS 


869-894 MHz (Rx) 
824-849 MHz (Tx) 


30 kHz 


FDMA 


FM 




Figure 13.1 Unrealizable software radio concept where the radio is totally software 
adaptable. 



Consider the receiver architecture. A broad-band LNA followed by an ADC digitise 
the received antenna signal, and all processing would continue in a DSP, as shown in 
Fig. 13.1. 

Obviously, this solution is completely unrealizable. Even if the DSP processing 
power were to ever become available, the specifications on the ADC would be pro- 
hibitive. The sampling rate would have to be of the order of 1 GHz in order to avoid 
tunable filters, and the dynamic range would have to be well in excess of 100 dB since 
no blocking filter is present. 

However, the search for a receiver architecture requiring as few high-frequency 
filters as possible is very active at present. 

Practical implementations of receivers basically rely on the sequencing of down- 
converting mixers and IF filters until the desired signal is at baseband. That is the case 
with the classical super-heterodyne architecture, see Fig. 13.2. The zero-IF receiver 
(which is also classical) eliminates the IF but also places all channel selection and fil- 
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(Fine resolution) (Fixed frequency) 




(Broadband) (Narrowband) Amplitude & Time 

Discretization 

Figure 13.2 Classical super-heterodyne architecture. Tuning is performed by the RF 
synthesiser LOI while the IF synthesiser is fixed. Channel filtering is performed at IF 
and in the baseband. The ADC requirements are very relaxed. 



tering in front of the ADC. More sophisticated architectures, like low-IF and quasi-IF, 
relax the requirements placed on the IF filters by careful use of image rejection mixers, 
but still place the ADC after channel selection and filtering. 

The result is that the ADC is the component with the most relaxed specifications in 
the receiver, while the filters need to accommodate very large dynamic ranges and are 
of high order (and high Q in super-heterodyne architectures). 

On the other hand, the need to perform the channel selection requires that the fre- 
quency synthesisers have a very fine frequency resolution equivalent to the channel 
spacing. Depending on the standard, the filters’ dynamic range can be above 100 dB, 
and the synthesiser’s frequency resolution can be as low as 30 kHz. The question we 
want to answer here is this: “Is it possible to relax the requirements placed on the 
filters and frequency synthesisers at the expense of a more complex ADC, but with no 
penalty on power dissipation or monolithic compatibility?” 

13.2 OVERSAMPLED ARCHITECTURES 

It is better to place the ADC before the channel filters, as shown in Fig. 13.3. The 
ADC must then be highly oversampled in order not to alias high frequency interferers 
into the baseband. 



LOI L02 




Amplitude & Time 
Discretization 



Figure 13.3 Super-heterodyne architecture with oversampling. 
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Irrespective of the receiver architecture, the baseband signal produced by the down- 
conversion mixer contains the channel signal at a weak level in addition to out-of-band 
interferers that can be much stronger that the desired channel itself. Depending on the 
IF filters’ selectivity and on the standard, these interferers can be as high as 70 dB 
above the desired signal. Taking into account fast-fading effects and AGC margins, 
the ADC will have to support a large dynamic range of 60 to 90 dB. 

The most convenient type of ADC to place at this point is a delta-sigma modulator 
(two are required, since the signal is in I and Q representation). Strong adjacent chan- 
nels are combined with quantisation noise and actually act as a dither source and help 
to randomise the bit stream. The digital decimating filter will then serve the dual func- 
tion of attenuating both the quantisation noise and the adjacent channels interference. 

In practice, the order of the baseband filter and the ADC has been interchanged. 
The filter is now implemented digitally, reusing existing hardware — the decimating 
filter. Only a simple anti-aliasing filter remains before the ADC, and the ADC has 
been highly oversampled to avoid aliasing by unfiltered high-frequency interferers 
and noise. As a result, the dynamic range, which was supported by the filter in the 
classical architecture, is now supported by the ADC. 

The advantages of this architecture are numerous. The I/Q matching accuracy is 
very good, depending basically on the matching of the two ADC input stages. The 
portability between different technologies is facilitated since more of the signal pro- 
cessing is done digitally. The filters can have a very accurate frequency response and 
a linear phase characteristic important for digital modulation techniques. Besides, the 
channel bandwidth can be easily programmed. 

But the ADC is required to operate at a very high sampling rate and must achieve a 
large dynamic range. We now compare the power dissipation of this ADC and that of 
the original filter it replaced. 

13.3 POWER DISSIPATION ISSUES 

The power dissipation of an analog circuit depends on many considerations, most of 
them related to the limitations of the technology being used. But it is instructive to 
consider the fundamental limitations of analog signal processing. 

The basic function of an analog circuit is to process a voltage (or current) signal, 
identified by its bandwidth B and amplitude, with a maximum degradation charac- 
terised by distortion and noise. Regarding distortion, there are no relevant physical 
limits determining a minimum power dissipation level. But in the case of noise, there 
is a clear relationship involving temperature, signal bandwidth and circuit dynamic 
range. For the case of voltage signals and optimal supply voltage corresponding to the 
signal amplitude, it is given by the following expression [1]: 



P diss = UTBDR, (13.1) 

where k is the Boltzmann constant and T is the absolute temperature. The dynamic 
range, DR, is defined as the power ratio between the maximum signal amplitude (can 
be an out of band interfered to the thermal noise integrated over the signal band- 
width B. The power associated to (13.1) is dissipated by the necessary charging and 
discharging of the capacitance on each circuit node carrying the signal. In a filter there 
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are, as a minimum, N such nodes, where N is the filter order. In an ADC, only one 
such node is necessary as a minimum: the sampling node. 

This analysis is valid irrespective of continuous-time or discrete-time circuits. The 
power dissipation is not affected by the actual sampling frequency of the ADC for a 
fixed signal bandwidth and dynamic range. This leads to two conclusions: 

1. The power dissipation of the ADC is not necessarily higher than that of the analog 
filter it replaced. 

2. The ADC can be oversampled by a large factor without penalty on the power dissi- 
pation. 

This second conclusion is important because it allows to use a sufficiently large 
sampling frequency so that the ADC images will be far enough from the desired chan- 
nel for the anti-aliasing filters to be of simple construction. It means that for a fixed 
signal bandwidth (the channel bandwidth) and fixed dynamic range, the sampling fre- 
quency can be increased without necessarily increasing the power consumption. 

There are practical limits to this, however. In switched-capacitor (SC) circuits, as 
the sampling frequency increases, the sampling capacitors can be decreased by the 
same factor, to keep the thermal noise k ■ ( T /C) constant within the signal band. The 
reduced load capacitors allow to keep the same amplifier slew rate at the original bias 
currents, but the amplifier needs to have a higher gain-bandwidth product. That, in 
turn, cannot be made higher than the internal non-dominant poles of the amplifier, 
otherwise the phase margin will be reduced to unacceptable values. 

Considering CMOS technology, the most power-efficient operating point of the 
transconductance defining MOS transistors in an amplifier is in moderate inversion 
with an over-drive voltage Vov = Vgs — Vy 50mV. At this point the ratio Gm/Iu 
is maximum. However, the transit frequency, Fy, of the transistors in the signal path 
must be above the sampling frequency by a factor of typically 20, at least, to preserve 
phase margin. 

In CMOS, Fy of the transistors is a function of the operating point: 



Fy = 



M ■ Vov 
L2 



(13.2) 



An NMOS transistor in a 0.35 tun technology has an Fy of about 2 GHz, allow- 
ing sampling frequencies approaching 100 MHz at optimum power dissipation, in the 
sense that the transistors responsible for poles are biased at the maximum Gm/Iq 
operating point. In fact, there are already several reports of delta-sigma modulators 
validated for sampling frequencies in excess of 50 MHz [2^1]. 

Above that frequency, the over-drive voltage of the transistors will have to be in- 
creased at least proportionally to the sampling frequency in order to keep their transit 
frequency high enough. Since the MOS transconductance is of the form 

G M = 2—, (13.3) 



the bias current will have to be increased as well in order to keep the transconductance 
values fixed. This makes the power consumption increase linearly with the sampling 
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frequency, until the over-drive voltage is large enough for the transistors to enter the 
velocity saturation region. Then the power dissipation will increase very fast with 
increases in sampling frequency. This is illustrated in Fig. 13.4. 




Figure 13.4 Typical power dissipation of a CMOS amplifier for constant ( F s ■ Cl)- 



13.4 IF SAMPLING ARCHITECTURES 

IF sampling architectures directly interface the ADC to a signal band centred at IF. 
This leads to a convenient system partitioning where the IF is the interface between 
the baseband signal processing chip and the RF front-end. 

The analysis in the previous section showed that it is possible to efficiently increase 
the sampling frequency of an SC circuit up to a frequency of 100 MHz. However, the 
IF frequencies are often above this frequency, requiring more advanced architectures 
to allow direct sampling of the IF signal. 

The solution is to operate in a sub-sampling mode. In this mode, the amplifiers 
operate at a reduced rate, and only the input SC branches need to settle accurately 
with IF input signals. Since these are simple passive branches, their operating speed 
can be much higher than that of the active charge transfers performed by the amplifiers, 
and can easily allow sampling signals well above 100 MHz. 

There are several possible approaches to integrate the complete IF-sampling re- 
ceiver in CMOS. 

One is to keep exactly the architecture of Fig. 13.3. The mixer could either be imple- 
mented as a classical Gilbert cell, using triode-region-biased transistors [6], or using a 
sub-sampling SC circuit. Double-quadrature architectures can also be considered for 
improved image rejection in the RF front-end providing I and Q IF. 

13.5 IMAGE-REJECTION SUB-SAMPLING ADC 

A more ambitious approach is to embed the sub-sampling SC mixer in the ADC, as 
shown in Fig. 13.5. The anti-aliasing filter is eliminated and its function is performed 
by the IF filters. This approach results in a significant reduction of receiver complexity 
and a potential power dissipation reduction since fewer blocks are required. 

The IF filters can be strongly relaxed since they only need to attenuate the aliasing 
bands of the ADC. The sub-sampling SC mixer can be implemented by appropriate 
phasing of the input SC branches of the first integrator of the delta-sigma modulator. 
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Figure 13.5 Receiver architecture using an image-rejection sub-sampling ADC. 




Figure 13.6 Block diagram of IF sampling block using two IRSSADCs. 



We denote the component combining the functions of sub-sampling mixing, image 
rejection (due to the double-quadrature topology) and analog-to-digital conversion as 
Image-Rejection Sub-Sampling ADC (IRSSADC). The application of IRSSADC for 
the realization of the IF sampling block is shown in Fig. 13.6, where L02/ and L02 q 
are the in-phase and quadrature signals of the IF local oscillator. Two IRSSADCs are 
required, realizing a double-quadrature down-conversion from an I and Q IF. 

The mixing and addition operations are realized directly at the input SC branches 
of the ADC. These are exemplified in Fig. 13.7 where the output is connected to the 
virtual ground of the first delta-sigma integrator. In a practical realization, the circuit 
would be fully differential and would use parasitic-insensitive SC branches. 

The clock phases in Fig. 13.7 realize a LO frequency at half the clock rate of the 
delta-sigma modulator, F s . Since the switches implement impulse sampling, all odd 
harmonics of the LO exist, thereby allowing sub-sampling operation. 

Since the LO is a complex-domain circuit, the LO frequency should actually be 
analysed also in the complex domain, where the frequency axis is not symmetrical. 
Assuming a model for the mixer in the form of a multiplier, the time waveforms repre- 
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Figure 13.7 Input SC branches with corresponding clock phases for implementing 
the IRSSADC. 
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Figure 13.8 Time domain equivalent waveforms representing the I and Q LO. 



senting the I and Q LO signal are as shown in Fig. 13.8. The arrows denote the Dirac 
function corresponding to ideal impulse sampling. 

Applying the Fourier transform, we obtain the frequency representation shown in 
Fig. 13.9. As expected, the spectrum is not symmetrical. Since the IF signal is also in 
a complex representation, the relevant band is located in the positive frequency axis. 
The frequency band in the negative frequency axis corresponds to the image band. 

The operation of this mixer is now clear: Consider the bands around F s /2, 5F s /2, 
9F s /2, ... Since the local oscillator frequency, L02, is F s /2, these bands correspond 
to the fundamental, 5 th harmonic, 9 th harmonic, etc., of L02. Therefore, a sub-sam- 
pling operation by a factor of 1, 5, 9, . . . , is possible. If we eliminate the inversion in 
Fig. 13.6, it can be similarly shown that the sub-sampling factors become 3 7, 1 1 ,... 

The presence of multiple input bands is typical of sampled systems and the desired 
band should be selected by an anti-aliasing filter in front of the mixer. This filter can 
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Figure 13.9 Frequency domain equivalent spectrum representing the I and Q LO. 



be the IF filter itself and should reject the bands ±4 times L02 away from the centre 
IF band. 

13.5.1 Case study 

As a practical example, for an IF centre frequency of 150 MHz, and when operat- 
ing with a sub-sampling factor of 5, the local oscillator frequency is 30 MHz and the 
delta-sigma modulator sampling frequency is 60 MHz. The IF filter should reject the 
aliasing bands around 30 MHz and 270 MHz. 

Assuming a typical application such as GSM, where the baseband channel band- 
width is 100 kHz, the delta-sigma modulator would be operating with an oversampling 
ratio of 60000/200 = 300. This would easily allow to achieve a dynamic range in ex- 
cess of 90 dB [4], This dynamic range is higher than the ratio of maximum blocking 
levels (within GSM band) to the minimum signal level, which is about 80 dB. 

The previous example confirms that the IF filters can be substantially relaxed since 
their function is basically to reject aliasing bands. 
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13.6 DIGITAL TUNING 

As already mentioned, the dynamic range of the ADC is increased by proper sizing 
of the sampling capacitance, which, for a large oversampling frequency, can be very 
small. This concerns the level of thermal noise. In delta-sigma modulators one must 
also consider the quantisation noise. In these circuits, the signal-to-quantisation-noise 
ratio (SQR) is a strong function of the oversampling ratio. By proper use of techniques 
such as 

■ high order modulator loops, 

■ multi-bit quantising, 

■ data-weighted averaging linearisation, 

an SQR above 90 dB can be easily achieved at a low oversampling ratio such as 16. 
This means that in the present application where the oversampling ratio is of the or- 
der of 100, the quantisation noise is negligible compared to thermal noise within the 
channel bandwidth. 



LOI L02 L03 




Figure 13.10 Receiver architecture using an image-rejection sub-sampling ADC and 
digital tuning. 



This motivates a more ambitious application of the IRSSADC. The idea is to use 
broadband digitalisation where a group of channels is down-converted to a very low 
IF. The channel selection is then performed by a third mixer, as shown in Fig. 13.10. 
At this point, signal processing is already digital, which allows much flexibility in 
defining channel spacing and channel filtering. 

For example, in the case study above, the IRSSADC, at an oversampling ratio of 
16, can digitise a band of 60 MHz / (2x16) = 1875 kHz. This allows to include 9 GSM 
channels at once (the channel spacing is 200kHz). 

The high flexibility due to easy digital programmability of channel centre frequency 
and spacing is very advantageous for implementing a multi-mode/multi-band terminal. 

Furthermore, the RF frequency synthesiser can now have a coarse frequency reso- 
lution, which significantly relaxes its design specifications. 

13.7 ARCHITECTURE SIMULATION 

A Matlab/Simulink model has been developed to represent the architecture shown in 
Fig. 13.11 consisting of an IF sub-sampling quadrature mixer, I and Q low-pass filters, 

1 and Q Delta-Sigma modulators, digital decimating filters and digital tuning. The IF 
filter is not included. 
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The Delta-Sigma modulators are of third order with 3-bit quantisation. They use dy- 
namic data averaging linearisation of feedback multi-bit DACs. The GSM case-study 
presented above is used here again. 




Figure 13.1 1 Architecture considered for simulation. 

The sub-sampling mixer is a switched-capacitor circuit that samples the signal at 
an F s of 60 MHz and mixes it with two sampled sine waves in quadrature with a fre- 
quency of F s /2 = 30MHz. This mixer generates I and Q sampled signals, which are 
filtered by two anti-aliasing filters. The first one is a first-order switched-capacitor low- 
pass filter (actually it is embedded in the mixer), and the second is a continuous-time 
second-order Butterworth low-pass filter. The signal is then converted to the digital 
domain, using a third-order delta-sigma converter sampling at F s and decimated with a 
sine 4 filter into 60/8 = 7.5 MHz. Afterwards, the signal is mixed digitally with a DDS- 
generated sine wave, converting the desired channel to base-band. Finally the signal 
is low-pass filtered and decimated by 16 to obtain the desired I and Q components of 
the signal. 

The signals applied are: 

■ Desired signal at 150.8 MHz with an amplitude 100 pV (0 dBref) 

■ Interferer 1 at 150.6 MHz (1 channel offset), amplitude 300 pV(+9.5 dBref) 

■ Interferer 2 at 150.4 MHz (2 channels offset), amplitude 12mV (+41 dBref) 

■ Interferer 3 at 149.2 MHz (8 channels offset), amplitude 12mV (+41 dBref) 

■ Interferer 4 at 147.8 MHz (15 channels offset), amplitude 630mV (+76 dBref) 

■ Interferer 5 at 89.25 MHz (near 3/5 of IF), amplitude 25 p.V (-12 dBref) 

■ Interferer 6 at 119.8 MHz (near 4/5 of IF), amplitude lOOmV (+60 dBref) 

The spectrum at the input of the Delta-Sigma modulator is shown in Fig. 13.12a, 
where the signals around IF were shifted to DC by the sub-sampling mixer. The in- 
terferer 16 is down-converted but strongly attenuated by the anti-aliasing filters (not 
shown). The interferer 5 is down-converted without attenuation, because it lies around 
the third harmonic of the IF local oscillator (90 MHz). If a complex IF were used, as 
in Fig. 13.5, this image would be strongly attenuated. Fig. 13.12b shows the spectrum 
at the output of the delta-sigma modulator, together with the transfer functions of the 
anti-aliasing filters (curve 1) and of the sine r decimating filter (curve 2). The signal 
dynamic range is still 76 dB (ratio between the strongest interferer, 14, and the desired 
signal). 













Figure 13.12 ADC AS converter input (a), ADC (AS) converter output (b). 
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Figure 13.13 Decimator filter output (a), Digital mixer output (b). 




Fig. 13.13a shows the resulting spectrum at the output of the decimating filter, 
before the digital mixer, together with the transfer function of the anti-aliasing filter 
plus sine. The Delta-Sigma converter quantisation noise is well below the desired 
signal. 

Fig. 13.13b represents the digital mixer output. Curve 1 corresponds to the com- 
bined transfer function of the anti-aliasing filter plus sine 4 . Now the desired signal 
is centred at DC and all interferes are in the rejection band of the digital low-pass 
decimation filters (not shown for clarity). 

In Fig. 13.14, the spectrum at the output of the system is shown with the interferers 
strongly attenuated and the desired signal well above the noise level. The last digital 
low-pass filter transfer function is also represented. The image of interferer 5 can also 
be seen, generated in the sub-sampling mixer, and the image of interferer 2 generated 
in the last decimation by two in the digital filter. Since this fdter has a stop-band 
attenuation of 65 dB and the interferer is 41 dB above the desired signal, the image is 
attenuated to 23 dB below the desired signal. 



WIDE-BAND SUB-SAMPLING A/D CONVERSION WITH IMAGE REJECTION 



185 




- 0.2 - 0.1 0 0.1 0.2 
Figure 1 3. 1 4 System output. 



13.8 CONCLUSIONS 

We introduced the concept of the image-rejection sub-sampling ADC, which is a com- 
bination of a highly oversampled ADC associated with a sub-sampling block imple- 
menting a double-quadrature mixer. 

This allows to implement wide-band conversion for radio receivers with associated 
advantages. On the one hand, software tuning and selectivity becomes possible, simpli- 
fying implementation of multi-mode capability. On the other hand, the specifications 
of the IF filters and of the RF frequency synthesiser can be relaxed, the former due 
to the large dynamic range of the ADC that accepts a high level of blocking signals, 
the latter due to the software tuning that allows coarse frequency resolution in the RF 
frequency synthesiser. 

An analysis was presented that shows that these advantages should be possible with 
no penalty on power dissipation, and simulation results demonstrate the efficacy of the 
principle. 
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14.1 INTRODUCTION 

In order to realise a multi-channel receiver system, a wide -band A/D converter must 
be available which can convert a complete band into the digital domain. The basic 
schematic of a multi-channel receiver is shown in Fig. 14.1. 




LO 



Figure 14.1 Multi-channel receiver schematic. 
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The multi-channel receiver has an antenna input (RF) followed by a band filter and 
a low noise amplifier (LNA). The RF signal is down-converted to an IF frequency and 
again some filtering can be applied. The IF signal containing the complete signal band 
is then converted to the digital domain by a wide-band A/D converter. The channel 
selectivity is implemented in the digital domain. The advantages of such a system 
are obvious: increased flexibility and reduced analog hardware when more than one 
channel is to be received simultaneously. These advantages of multi-channel receiver 
are particularly interesting for cellular base stations, where multiple channels are re- 
ceived. The signal at the input of the A/D converter combines a wide bandwidth with 
a large dynamic range. This large dynamic range is due to the fact that neither channel 
filtering nor variable gain can be used before the A/D conversion in a multi-channel 
receiver. This means that the converter has to handle at the same time maximum 
blocking and interfering signal levels while still receiving channels with signals at 
minimum power levels. This puts severe performance requirements on the linearity 
and noise contributions of the A/D converter, 

14.2 PERFORMANCE METRICS FOR A/D CONVERTERS 

There are two reasons to introduce performance metrics for A/D converters: 

■ to evaluate the feasibility of receiver architecture with a wide-band A/D converter, 

■ to compare different A/D converter realisations (benchmarking). 

The performance metrics used here are based on the effective performance in terms 
of dynamic range, bandwidth, and power dissipation. These three parameters are com- 
bined into a global figure of merit. It should be clear to the reader that other parameters 
such as latency, power supply voltage, input range, input impedance, etc. are important 
as well, but these are not included in the figure of merit. First the resolution bandwidth 
product of an A/D converter is introduced. The A/D converter can be regarded as a 
black box: analog in, digital out as shown in Fig. 14.2. 

Assuming that a full-scale, single-sine-wave input signal is applied to the A/D con- 
verter, the digital output can be analysed using its Fast Fourier Transform (FFT) spec- 
trum. This FFT spectrum contains the input signal together with noise and harmonic 
components. From this FFT spectrum the SINAD (Signal to Noise And Distortion) 
is calculated. This SINAD can be translated in ENOB (Effective Number Of Bits), 
describing the effective resolution of the A/D converter: 



ENOB = ( SINAD - 1.76dB)/6.02 (14.1) 

This ENOB is measured in function of the input signal frequency (c.f. Fig. 14.2). 
The closer the ENOB approaches the nominal number of bits the better. The useful 
bandwidth of the A/D converter can be limited in two ways: First, the Ny quist criterion 
gives a maximum bandwidth to be converted equal to half the sampling frequency 
(f s /2). The second limitation is due to the fact that A/D converters tend to have a 
reduced ENOB for high-frequency input signals. Therefore the effective resolution 
bandwidth is defined as the minimum of the Nyquist frequency (/,/ 2)and the input 
signal frequency for which the ENOB is reduced by 0.5 bit (/in,enob-0,5) compared 
to the ENOB at low input frequencies ( ENOBq ). The effective resolution bandwidth 
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Figure 14.2 A/D converter: Black-box representation and ENOB in function of f\ n . 



product ( BWres ) combines both the effective resolution and the effective resolution 
bandwidth: 



BWres = 2 ENOB ° • min (/, /2, / in , eno b-0,5) (14.2) 

This product is proportional to the area in Fig. 14.2, where the difference to the nom- 
inal resolution-bandwidth becomes clear. The effective resolution-bandwidth products 
for some A/D converters are plotted in Fig. 14.3. 

If the effective resolution bandwidth product is combined with the power dissipa- 
tion of the A/D converter a Figure of Merit ( F.o.M .) can be defined: 



F.o.M. = power/2 ■ BWres 



(14.3) 
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Figure 14.3 BWres and ENOBo for some A/D converters. This data is derived from 
manufacturers data sheets [2]. 
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The factor of 2 is introduced to be compatible with the existing F.o.M. based on 
nominal performance, i.e. /^o.A/ n0 minal = power/(2”/ f ). The dimension of the F.o.M 
is Joule per conversion, and for better performance its value should be as small as 
possible. In Fig. 14.4 the evolution of this F.o.M. as a function of time is plotted. 

The improvement in A/D converter performance over time is both due to design 
innovations and technology improvements. At this moment, wide-band A/D convert- 
ers (10... 14 bit) that achieve a F.o.M. of 5 pJ per conversion are state of the ail. The 
F.o.M. can be used to estimate the A/D converter’s power dissipation for various appli- 
cations in a resolution-bandwidth graph as shown in Fig. 14.5. The lines of constant 
power dissipation are based on a F.o.M. of 5 pJ per conversion. 

The wide-band A/D converters to be used in base-station receivers require band- 
widths in the order of tens of MHz and resolutions in the order of 12 to 16 bit, de- 
pending on the IF frequency used and the amount of filtering in the analog part of 
the receiver. These A/D converters have a power consumption in the order of 1 Watt 
and are therefore not suitable for implementation in handsets. For base station applica- 
tion, these levels of power dissipation are acceptable, and the benefits of receivers with 
wide-band A/D converters can be exploited as multiple channels share the same analog 
front-end plus A/D converter. The realisation of this A/D converter for multi-channel 
receivers is the main challenge in A/D converter design. 

14.3 RECEIVER ARCHITECTURE AND ADC SPECIFICATION 

The resolution and bandwidth required for a multichannel receiver is explained with 
GSM as an example. The GSM signal spectrum is shown in Fig. 14.6. 
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Figure 14.4 F.o.M. evolution in function of time [2-6]. 
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Figure 14.5 ADC power dissipation in resolution-bandwidth graph based on 5 pJ per 
conversion. 



The 25 MHz signal band is divided into 200kHz-wide channels. The base station 
has to perform according to conformance test for blocking, interferes and sensitivity. 
The conversion to the digital domain of a complete band requires a dynamic range in 
the order of 100 dB and a bandwidth larger than 25 MHz. If only a single channel is to 
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Figure 14.6 GSM signal band. 



be converted, channel filtering and variable gain can be introduced in the analog signal 
path to reduce the dynamic range requirements in the A/D converter [7]. 




single- 
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Figure 14.7 Receiver architecture with wide-band A/D converter. 



Two architectures containing wide-band A/D converters are shown in Fig. 14.7. 
The difference between them is the presence of a (single) channel filter in the first 
one. For this single channel receiver, a variable-gain amplifier can be introduced to 
reduce the required dynamic range of the A/D converter. The second architecture 
is a multi-channel architecture that does not allow variable gain in front of the A/D 
converter to reduce the A/D converter requirements. 

When the performance of the ADC is mapped to the system requirements, two 
kinds of Dynamic Range (DR) should be distinguished: DR due to noise limitation 
(SNR) and DR due to distortion and spurious signals (SFDR). The theoretical perfor- 
mance of an A/D converter results in an SNR improvement of 6 dB per additional bit 
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and an SFDR improvement of 9 dB/bit (3 rd harmonic in ideal quantisation). For the 
SNR specification we have to take the “processing gain” into account: as the channel 
bandwidth is smaller than the Nyquist bandwidth, the noise outside the wanted channel 
will be filtered out, resulting in an SNR improvement of 101og(BWNy qu ist/BW C hannei)- 
In Fig. 14.8, the SNR and SFDR of a 12bit A/D converter is plotted as a function of 
the input signal level related to the full scale of the A/D converter (dBFS). 




Figure 14.8 SNR and SFDR as a function of signal level. 



A real 12-bit converter typically achieves an SFDR of 80 dB (theoretically 108 dB 
SFDR), which is not sufficient for a multi-channel base station receiver. This requires 
at least a highly linear 14-bit wide-band A/D converter to achieve an SFDR of about 
100 dB. 

14.4 CASE STUDY: SINGLE-CHANNEL RECEIVER BASED ON A 
12-BIT WIDE-BAND A/D CONVERTER 

The difference between real and ideal A/D converters is explained by a 12-bit A/D 
converter realisation [1], This BiCMOS A/D converter is based on a cascaded fold- 
ing-interpolating architecture resulting in a power/area efficient implementation. The 
folding-interpolating quantiser is preceded by a wide-band Track-and-FIold to obtain 
a resolution bandwidth equal to f s /2. The limiting factors in ENOB are contributed 
by various sources of noise and distortion: nonlinearity in the Track-and-FIold, INL er- 
rors due to mismatching in the quantiser, thermal noise, clock buffer noise, etc. These 
sources of noise and nonlinearity are often proportional (or quadratically proportional) 
to the power consumption. As an example, the breakdown of the contributions of noise 
and distortion for the 12 bit A/D converter realisation is shown in Fig. 14.9. 

Despite the state-of-the-art F.o.M. of 5 pJ per conversion, this 12-bit A/D converter 
has not enough dynamic range to be used in a multi-channel receiver for cellular base- 
stations. Flowever, this wide-band A/D converter can be used to realise a single -chan- 
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Figure 14.9 Noise and nonlinearity contributions to ENOB for a 12-bit A/D converter. 




Figure 14.10 Single-channel IF-to-digital conversion system. 



nel IF-to-digital conversion system. This system is shown in Fig. 14.10. The channel 
filter at IF reduces the dynamic range requirements and allows the use of a variable- 
gain amplifier in front of the ADC. In this case, the variable gain amplifier is digitally 
controlled and has a gain variable from 6 to 30 dB in 6-dB steps. The basic schematic 
of the Digitally Controlled Variable Gain (DVGA) is shown in Fig. 14.11. This addi- 
tional gain range extends the dynamic range for noise of the single channel receiver to 
about lOOdB for a 200kHz channel. 

This DVGA was designed for low noise, low distortion, and fast settling, and it re- 
places an input buffer amplifier to drive an A/D converter. The SFDR of the combined 
DVGA and ADC is shown in Fig. 14.12. 

While the SFDR of the A/D converter decreases with higher input signal levels, the 
distortion of the DVGA increases. Since a channel filter is introduced in front of the 
converter, the global SFDR of DVGA and ADC is sufficient to realise a single-channel 
IF-to-digital converter. 









Figure 14.1 1 Digital Controlled Variable Gain Amplifier (DVGA) schematic. 




Figure 14.12 SFDR for DVGA and ADC. 
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14.5 CONCLUSIONS 

A/D converters for multi-channel receiver base-stations is THE challenge in ADC de- 
sign. Based on performance metrics and observations of state-of-the-art A/D convert- 
ers, it can be concluded that the combined dynamic range and bandwidth requirements 
for multi-channel receivers will not be low power: these A/D converters will be hot. As 
ADCs for multi-channel receivers are not yet available, wide-band A/D converters can 
be used in combination with DVGA to build IF-to-digital conversion for single-chan- 
nel receivers. 
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15.1 INTRODUCTION 

In digital wireless systems, a need to quantize a block of spectrum with low intermod- 
ulation has become one of many technically challenging problems. Implementing IF 
filters digitally has already become a necessity in cell sites and base stations as ex- 
plained in Fig. 15.1. Even in hand-held units, placing data conversion blocks closer 
to the RF front end has many benefits. Substantial savings in system cost and com- 
plexity can be attained by implementing highly selective IF filters digitally. Digital 
IF filtering can increase immunity to adjacent and alternate channel interferences, and 
the RF transceiver architecture can be made independent of the system and adapted to 
different standards using software. However, all these predictions for the software ra- 
dio rely on the availability of low-spurious ADCs and DACs that consume low power 
at low supply voltages. 

Table 15.1 summarizes various RF communication standards. The fundamental 
limit in quantizing the IF spectrum is crosstalk and overload, and the system perfor- 
mance heavily depends on the SFDR (spurious-free dynamic range) of the sampling 
ADC. Spurious components result from two non-linearity sources during the quan- 
tization process. One is the differential non-linearity (DNL), and the other is the 
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Figure 15.1 Software programmable multi-standard radio transceivers. 



Table 15.1 Parameters of various RF communications. 



Applications 


IFBW 


Modulation 


Baseband 


FM mobile 


30 kHz 


FM 


4 kHz 


GSM 


200 kHz 


GMSK 


13 kb/s 


CDMA 


2 MHz 


M-PSK 


8 kb/s 


Military FH 


30 kHz 


FH-QPSK 


16 kb/s 


JTIDS 


3 MHz 


MSK 


150 kb/s 


Microwave 


20 MHz 


64 QAM 


90 Mb/s 



integral non-linearity (INL). The former produces harmonically unrelated spurious 
components in the quantized spectrum while the latter produces intermodulation com- 
ponents. A possible solution to reduce spurious components resulting from the limited 
DNL is a dithering method to spread the spurious components randomly [1], On the 
other hand, the limited INL can only be improved by straightening the ADC trans- 
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fer characteristic. In general, only high-resolution techniques using oversampling or 
calibration are known to improve INL. 

15. 1. 1 Technical challenges In digital wireless 

Fig. 15.2 explains why wide dynamic range is needed to quantize a weak signal em- 
bedded in strong interferences. To meet this growing demand, low-spurious ADCs are 
being actively developed in ever increasing numbers. Current commercial products 
for this purpose are only a few, such as Analog Devices’s AD6640 (12 b, 65 MHz) and 
National’s CLC952 (12 b, 41 MHz), but more are being announced this year. Typi- 
cal performance currently possible with bipolar or BiCMOS technologies ranges from 
12 b to 14 b sampling at 65 MHz with a typical SFDR of 70 dB to 80 dB, although 
numbers higher than 80 dB are often quoted for converters using a dithering method 
[1]- [3]. Typical power consumption of such parts is well over half a watt. For a 
14b-level dynamic range while sampling at over 50MHz, it is necessary to control 
the sampling jitter to below 0.3 ps. The low-jitter requirement is on the border line 
of the bipolar chip performance, but very challenging for scaled CMOS technologies. 
However, unlike non-linearity that causes inter-channel mixing, the random jitter in IF 
sampling increases only the random noise floor. As a result, the random jitter is not 
considered as a fundamental problem in this application. 



-1 3dBm 
Strong Signal 



-104dBm 
Weak Signal 



91 dB 

Dynamic Range 



-114dBm Noise 



Figure 15.2 Spectral density of the RF spectrum. 



Considering the current state of the art in CMOS ADCs, most architectures known 
to date are unlikely to achieve a high sampling rate of over 50 MHz even with 0.2 to 
0.3 pm technologies [4-13], but the situation looks better for bipolar or BiCMOS [1- 
3,14-16]. The ADC architecture varies depending on system requirements. 

15. 1.2 ADC state of the art 

Fig. 15.3 is a resolution-versus-speed plot showing the trend of ADC-development 
works recently reported at the ISSCC. Three architectures stand out for three important 
areas of applications. For example, the oversampling converter is exclusively used to 
achieve high resolution above 12 b level at low frequencies. The difficulty in achieving 
better than 12 b matching in conventional techniques gives a fair advantage to the 
oversampling technique. For medium speed with high resolution, pipeline ADCs are 
promising. At extremely high frequencies, only flash and folding ADCs survive, but 
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with low resolution. As both semiconductor process and design technology advance, 
the performance envelope will be pushed further. Table 15.2 lists the state of the art of 
> 12 b ADCs in different technologies. 
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Figure 15.3 ADC performance, resolution versus speed. 



The ADC performance is often represented by a figure of merit L as shown in 
Fig. 15.4. The higher L is, the more bits are obtained at higher speed with lower 
power. The plot of L versus year shows the low-power and high-speed trend both for 
integrated CMOS and bipolar/BiCMOS ADCs published at the ISSCC during the last 
decade. Overall, the low-power trend is evident though not dramatic. 

15.2 TECHNIQUES FOR HIGH-RESOLUTION ADCS 

There exist many high-speed ADC techniques for a wide range of resolution and 
speed. Table 15.3 summarizes some of them and their features. In general, the main 
criteria of choosing ADC architectures are resolution and speed, but additional re- 
quirements such as power, chip area, supply voltage, latency, operating environment, 
or technology often limit the choices. The current trend is toward low-cost integration 
without using expensive discrete technologies. Among many high-speed data conver- 
sion architectures, two high-speed candidate architectures, pipeline and folding, and 
their variations with new system approaches are potential candidates to challenge the 
limit set by the software radio. This paper is focused on these two architectures and 
presents two possible ways to improve their linearities. 

In general, ADC design examples are so diverse that it is difficult to fit all of them 
into a few fixed frames. While pipeline ADCs are limited by the DAC resolution 
and residue amplifier gain, folding ADCs suffer from poor offset of the folding am- 
plifier. Therefore, high-resolution techniques should be tailored for each architecture. 
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Table 15.2 State-of-the-art ADCs. 



Author 

Note 


Speed 

Bits 


Architecture 

Technology 


Power 
Area (mm 2 ) 


I. Opris 


20 M 


Pipeline w/ self calib. 


250 mW 5 V 


ISSCC98 


12 b 


0.7 pm CMOS 


3.2 x 3.1 


L. Singer 


10M 


Pipeline 


210 mW 5 V 


VLSI96 


14b 


0.8 pm CMOS 


19 


S. Kwak 


5M 


Pipeline w/ calibration 


60 mW 5 V 


ISSCC97 


15 b 


1.4 pm CMOS 


4.4 x 6.2 


P. Vorenkamp 


50 M 


Cascaded folding 


300 mW 5 V 


ISSCC97 


12b 


13 GHz 1pm BiCMOS 


2.5 x 3.3 


T. Shu 


10M 


Pipeline w/ calibration 


350 mW 5 V 


VLSI94 


13b 


1.4 pm BiCMOS 


6.7 x 5.0 


M. Mayes 


1M 


Pipeline w/ calibration 


200 mW 5 V/3.3 V 


ISSCC96 


16b 


1 pm BiCMOS 


5.7 x 6.1 


R. Jewett 


128 M 


Two-step w/ dithering 


5.7W5/-5.2V 


ISSCC97 


12b 


25 GHz Bipolar 


5.6 x 7.3 



Analog high-resolution techniques reported to date to improve ADC linearity are as 
follows: 

■ Dynamic matching (Plassche, JSSC, June 1979) 

■ Ratio-independent (Li, JSSC, Dec. 1984) 

■ Reference-feedforward (Sutarja, JSSC, Dec. 1988) 

■ Capacitor-error averaging (Song, JSSC, Dec. 1988) 

■ Current sampling (Schouwenaars, JSSC, Dec, 1988) 

■ Voltage sampling (Kerth, JSSC, April 1989) 

■ Inherently monotonic (Wu, ISCAS, 1994) 

The dynamic matching technique is efficient, but it generates high-frequency noise. 
The ratio-independent, reference-feedforward, and inherently monotonic techniques 
either require more than two clock cycles or are limited to improvement in DNL 
only. The capacitor-error averaging technique requires three clock cycles, and the 
voltage/current sampling techniques are sensitive to clock sampling errors. 

The ADC self-calibration concepts have originated from the direct code-mapping 
concepts using memory. The code-mapping technique is to predistort the digital input 
of the DAC so that the DAC output can match the ideal level from the calibration equip- 
ment. Due to the precision equipment needed, this method has limited use. The first 
self-calibration concept applied to the binary-ratioed successive-approximation ADC 
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Figure 15.4 Figure of merit (L) versus year. 
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Figure 15.5 Capacitor ratio error measurement. 



is to internally measure capacitor DAC ratio errors using a resistor-string calibration 
DAC as shown in Fig. 15.5 [17]. Later, an improved concept of the digital-domain cal- 
ibration was developed for the multi-step pipeline ADC [6], and the trend continues to 
calibrate ADCs in the background while ADCs are in operation [1 1,15]. 
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Table 1 5.3 High-speed ADC techniques and their features. 



Techniques 


Features 


Flash 


Fastest, but complexity increases exponentially. 


Interpolation/averaging 


Reduces the number of preamplifiers and improves 
DNL. 


Pipeline 


Fast, but limited by DAC accuracy and residue amplifier 
settling. 


Folding 


Smaller number of comparators than flash, but limited 
by speed and offset of the folders. 


Time interleaving 


Parallel multi-path system, but limited by fixed pattern 
noise and clock accuracy. 


Subranging 


Smaller number of comparators than flash and monoto- 
nic, but limited by comparator accuracy. 


Double sampling 


Have more time, but need extra interstage S/H. 


Current domain 


Fast, but no precision linear current source exists. 



Analog 

Output Ideal 




Figure 15.6 Digital code-error calibration technique. 



The general concept of the digital-domain calibration is to measure code or bit er- 
rors, to store them in the memory, and to subtract them during the normal operation. 
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• Simple binary code 

• Good INL 

• Convenient to measure ratio enors 

Analog 

Output 




Cod* 



• Complex thermometer code 

• Good DNL 
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Analog 

Output 




Figure 15.7 Comparison of binary and thermometer code-error measurements. 



The concept is explained in Fig. 15.6 using a generalized N-bit MDAC using a capac- 
itor array [6]. If the DAC code increases by 1, the MDAC output should increase by 
V re f or Vref/2 if redundancy is used for digital correction. Any deviation from this 
ideal step is defined as a segment error. The code errors are obtained by accumulating 
segment errors. This segment-error measurement needs two cycles. The first cycle 
is to measure the switch-feedthrough error, and the next cycle is to measure the seg- 
ment error. The segment error is simply measured using the LSB-side of the ADC by 
increasing the digital code by 1. In the case that N= 1, the segment error is the bit 
error. If binary biterrors are measured and stored, the code error should be calculated 
for subtraction during the normal operation. How to store DAC errors is a trade-off 
issue. Ways to measure binary and thermometer code errors are explained in Fig. 15.7. 
Examples of the digital calibration are well documented for the cases of the segment 
error [11] and bit error [14] measurements. 

The static linearity can be improved using many techniques as discussed, but the 
speed improvement is not straightforward. Although digital calibration can improve 
the resolution of the pipeline ADC to over 15b, the settling time of the interstage 
residue amplifier has been the bottleneck in achieving high conversion rate. 

15.3 OUTLOOK 

New circuit techniques and architectures will lead to the implementation of data con- 
verters for digital IF applications. Assume the goal is to realize an ADC that meets 
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the 14 b, 50MHz requirement using low-power CMOS technologies. New high-reso- 
lution CMOS ADC architectures need to be explored so that high linearity and speed 
can be readily obtained with less hardware complexity. The following are identified 
as major factors limiting performance in two candidate architectures. 

■ Pipeline ADC 

- Speed of the residue amplifier 

- Linearity and calibration 

■ Folding ADC 

- Offset of the folding amplifier 

- Complexity for high resolution 

- Linearity and calibration 

In the pipeline ADC, the settling time of the interstage residue amplifier has been 
the bottleneck in achieving high conversion rate. The system can be reconfigured so 
that the interstage gain may be obtained using a transresistance amplifier to speed 
up. Also the resolution of the folding ADC can be improved using a subranging con- 
cept combined with cascaded folding. Subranging will greatly simplify the folding 
architecture, and the idea of calibrating the folding ADC is feasible with manage- 
able complexity. These two approaches have never been applied to ADCs with more 
than 14 bits. To achieve such a linearity at over 50MSample/s rate using CMOS, we 
have to overcome numerous technical barriers. The ADC of this caliber has yet to be 
demonstrated. An oversampling delta-sigma calibrator will enable all calibrations in 
the background. 

Fig. 15.8 illustrates a residue amplifier that can be used as a front-end of the pipeline 
ADC. The system uses a current -domain residue amplifier, which has an added advan- 
tage of using a current DAC which can be controlled more accurately at high speed 
than voltage DACs. In the proposed architecture, linearity is limited only by the match- 
ing between current sources and resistors. Since the matching in IC processing is not 
enough, we need to calibrate current sources in the background using an oversam- 
pling delta-sigma calibrator [15]. The ultimate limit of the conversion rate is set by 
the settling time of the residue amplifier. For example, if the amplifier settles within 
10ns with a 12 b resolution, the closed-loop time constant is about 1.1 ns. This in turn 
requires a closed-loop bandwidth of about 145 MHz. This closed-loop bandwidth is 
believed to be well within the reach of the current CMOS technologies. The system 
differs from others having the following distinct features. 

■ Current-domain residue amplifier for fast settling 

■ Background calibration of current-domain MDAC 

■ Oversampling delta-sigma calibrator 

The overall system works as follows. The input sample-and-hold amplifier makes 
a new sample ready for the residue amplification. The coarse digital values are also 
ready to switch differential current DAC elements for the residue amplification. An 
LSB stage is also assumed to be ideal within its resolution. One left-over phase of the 
residue amplifier is used to calibrate the current DAC elements, where the output of 
the residue amplifier corresponding to V re f is compared to the ideal V re f using a delta- 
sigma modulator. The bitstream output from this modulator is counted and averaged 
to increment or decrement a small current DAC to trim the current source. Behavioral 
simulation results of the quantized output show that without calibration, the spurious 
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Figure 15.8 Pipeline ADC front-end with background-calibrating current DAC. 



level approaches -70dB assuming 0.2% component mismatch. However, with the 
calibration engaged, the spurious level is suppressed to below - lOOdB. 
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Figure 15.9 A background-calibrating pipeline folding ADC with subranging and in- 
terpolation. 
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The folding architecture has been most frequently adopted for high-speed ADCs. 
The folding circuit simultaneously produces many zero crossing points, but they need 
to be as accurate as the resolution of the ADC. Two limiting factors are the accuracy of 
the folding thresholds and the linearity of the folding amplifier. Due to the difficulty 
of controlling these, folding architectures have been known for high speed but for 
resolution typically lower than 8 to 10 bits [18-20]. However, at a 14b level, the 
system also becomes prohibitively complex, and a new efficient folding architecture 
is needed. In addition, the new architecture should be calibratable for high linearity. 

Several improvements can be made on the basic folding and interpolation architec- 
ture to overcome its limitations. The following features can increase the number of 
bits effectively. 

■ Cascaded folding with an odd number of folding blocks 

■ Subranging concept in folding 

■ Background calibration of the reference and folding amplifier offsets 

■ Oversampling delta-sigma calibrator 

For cascading, it is necessary to generate folded signals from the first folding am- 
plifier, and a Gilbert-type multiplier circuit has been used [3]. However, the multiplier 
requires level shifters and cascode circuits that make the implementation more com- 
plex and introduce errors in the folding process. This difficulty can be overcome by 
using the same current switch as in the first folding amplifier for the intermediate 
stage. For this, the folding scheme needs to be modified so that the number of folding 
blocks becomes an odd number. If the number is a multiple of an odd number, the 
second folding outputs can be generated by summing the first folding outputs. Since 
the summation of such outputs can be implemented using current switches, the same 
circuitry used for the first folding amplifier can be used repeatedly. Using this and the 
subranging concept, one solution to a calibratable cascaded folding architecture can 
be configured as shown in Fig. 15.9. The number of output codes is not a power of 
two, but the number of folding current switches is reduced to only 84. The first-stage 
folding amplifiers’ offsets are of interest and subject to calibration. 

The linearity of the resistor ladder network can be calibrated without interfering 
with the normal conversion using a delta-sigma calibrator [15]. An offset of the folding 
amplifier has the same effect as the reference error, and it is the main source of errors. 
To achieve 50MSample/s conversion speed, it is necessary to have the reference plus 
folding outputs settle within 10ns. To guarantee high resolution, a calibration scheme 
to cancel the folding amplifier offsets should be incorporated. Fig. 15.10 shows such 
an implementation of the folding current switch. When one folding current switch is 
under calibration, its output is separated from the second folding block and fed into 
an error measurement block. The input to the folding amplifier under calibration is 
shorted to null the output. The output necessary for the second folding block is gen- 
erated by interpolation. The error measurement block measures the folding amplifier 
output, and the calibration logic adjusts the bias current of the folding current source 
until the output of the folding amplifier is nulled. Like most pipeline ADCs, redun- 
dancy can be used in the folding architecture for digital correction. A prototype of the 
LSB 10 b ADC to be used in this architecture is implemented separately to prove the 
speed of this architecture. Fig. 15.11 shows the 19 MHz input sampled at 157 MHz, 
and its reconstructed waveform shows distinct steps. 
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The missing FA2 is inserted by Interpolation 



Figure 15.10 Folding amplifier offset calibration scheme. 
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Figure 15.11 A/D-D/A outputs of a 1 0 b pipeline folding ADC. 



15.4 CONCLUSIONS 

Digitization is now about to move into IF for wireless systems. The software-pro- 
grammable radio offers the technically most challenging tasks to the data acquisition 
design community. Only high-performance ADCs and DACs, implementable in low- 
cost VLSI technologies and consuming one order of magnitude less power than current 
circuits, will make the wide-spread use of software radios feasible. CMOS data con- 
verters are not yet in this arena. However, two examples of high-resolution ADCs 
shown here prove that even CMOS ADCs can move into the 100 MHz range with high 
resolution if new robust architectures are developed. The current ADC art, estimated 
by recent works, is projected to achieve a linearity of 15 b to 16 b with a SFDR of over 
90 dB at a 100 MHz range in the near future. 
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16.1 INTRODUCTION TO THE FOLLOWING PAPERS 

Today, silicon CMOS, bipolar and BiCMOS technologies dominate the field of dig- 
ital and low-frequency analog monolithic integrated circuits. At RF frequencies of 
500MlTz and above, the choice of technology is no longer so straightforward, as other 
technologies based on III-V semiconductors (GaAs, InP) or SiGe heterojunction de- 
vices show distinct advantages at least for certain building blocks like RF power am- 
plifiers. The advantages and disadvantages of each of these technologies with respect 
to performance, cost, ease of design, etc. are currently a subject of hot debate, both in 
industrial companies and academic circles. Furthemiore, the discussion with regard to 
the optimum choice of integrated circuit technology for a certain part of an RF radio 
system partly overlaps with the — no less controversial — debate on the desirability 
and usefulness of the so-called “single-chip radio.” The overlap of both controver- 
sies about technology and the “right level of integration” also shows up in the three 
contributions of this part. 

Some contributions in this volume deal with aspects of the single-chip radio mainly 
from a CMOS point of view. This seems natural because most of the push towards 
the “single chip radio” currently comes from the people working their way up from 
CMOS baseband processing to include more and more RF functions. A glance at 
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the current publications on single-chip RF radios, which are too numerous to be cited 
here, reveals, however, that most RF single chips presented still lack very important 
parts of a complete transceiver, such as the final power stages, RF switches, selective 
filters, and, sometimes, oscillators. 

The extent to which all these functions can be integrated in the future depends not 
only on the advances made by RF circuit designers, but also on the general develop- 
ment of the silicon technology, which is driven by digital applications. It is not clear 
whether the trends in digital CMOS, e.g. towards ever lower supply voltages (mainly 
due to reduced gate length) and oxide thickness, do not contradict the requirements of 
the RF circuit designer. 

The first contribution, written by C. A. King, highlights the importance of low- 
cost processing in advanced BiCMOS technology development. With a “CMOS first” 
philosophy, the additional processing steps for fabricating the bipolar devices are min- 
imized. Nevertheless, a peak fj for the bipolar devices of 30 GHz is achieved. Using 
similar device structures with the same “CMOS first” philosophy, Si/SiGe heterostruc- 
ture bipolar transistors have been fabricated with a peak cut-off frequency of 54 GHz. 
The process steps for device fabrication are shown and some properties of the Si/SiGe 
material system are discussed. The evaluation of the RF device performance and the 
extracted delay elements show that further improvements in fj should be possible by 
reducing the junction capacitances. 

R. Christ’s contribution compares GaAs-Metal-Semiconductor-Field-Effect-Tran- 
sistor (MESFET) RF IC technology with competing technologies seen from the view- 
point of a commercial GaAs IC supplier. The transmitter linearity requirements in 
advanced code-division multiple-access (CDMA) systems such as IS-95 are consid- 
erably higher than in the GSM (global system for mobile communications) system. 
Today, the IS-95 specifications can be met more easily with power amplifiers based 
on the GaAs MESFET or GaAs Heterojunction Bipolar Transistor (HBT) than with 
silicon devices. However, it is not clear whether the HBT or MESFET will be more 
economical, as HBT technology generally has a higher cost per wafer, but achieves 
smaller die size for a given power output. In the following discussion of the single 
chip radio, it is stated that full integration for its own sake makes no sense commer- 
cially. Aspects like development time, product life time, and design flexibility often 
call for less-than-maximum integration levels in commercial RF chip sets. Finally, 
the key parameters of a state-of-the-art GaAs RF MESFET process featuring enhance- 
ment and depletion MESFETs, high-Q inductors (> 17 at 1.9 GHz), MIM capacitors, 
and NiCr resistors are described. 

In the third paper by U. Lott and W. Bachtold, different RF IC technologies are 
compared regarding their frequency range of application and suitability for RF and 
baseband integration. Thereafter, examples of RF ICs working at 2 GHz and above 
are described. Ultra-low-power, low-noise amplifiers (LNAs) have been realised in 
both a silicon bipolar and a GaAs MESFET technology at 1.9 GHz. Another LNA 
including the switching function for a three-input-antenna diversity system at 4.5 GHz 
has demonstrated low noise and low power performance. Finally, a 200 mW power 
amplifier for 17 GHz using High-Electron-Mobility-Transistor (HEMT) technology is 
shown. These examples highlight the potential of advanced semiconductor technolo- 
gies for realising integrated circuits at multi-GHz frequencies, where RF CMOS will 
lag behind GaAs MESFET technology for several years to come. 
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Abstract: BiCMOS for analog circuits is finding applications for which the high 
transconductance, low noise, high output resistance, and additional speed of the bipo- 
lar devices yield a performance not easily obtained through a CMOS technology alone. 
However, for BiCMOS to remain a viable alternative for these applications, costs must 
be kept low. Our BiCMOS approach, MBiC (Modular BiCMOS), starts with a state-of- 
the-art CMOS technology and adds a bipolar transistor in a modular fashion while 
minimizing additional process steps. Using process steps such as high-energy implan- 
tation in place of epitaxy over a heavily doped sub-collector, and amorphous-Si refill 
techniques under the extrinsic base region, we produce Si homo-junction devices with 
cutoff frequencies of about 30 GHz with only three additional masking levels. Using 
similar device structures with the same “CMOS first” philosophy, we also fabricated 
Si/SiGe hetero-junction bipolar transistors with a cutoff frequency of 54 GHz. 
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17.1 INTRODUCTION 

Si BiCMOS technology is important for wireless applications since it provides low- 
noise transistors with low power dissipation at microwave frequencies along with 
the state-of-the-art CMOS needed for logic applications. This capability means that 
BiCMOS technology provides the most likely opportunity to combine the radio and 
signal-processing portions of wireless circuitry to make single-chip phones possible. 
While CMOS alone may provide this capability at the lower bands in the future, mar- 
kets will not wait for CMOS technology to evolve before striking a demand. There- 
fore, the development of BiCMOS technologies will continue for current wireless com- 
munication applications and for applications in emerging frequency bands. 

17.2 SILICON BiCMOS 

BiCMOS process development at Lucent Technologies follows a modular approach. 
The bipolar transistor is integrated into an existing CMOS process without changes 
to the core process so that re-qualification of the MOS devices is not necessary. In 
addition, the inclusion of the bipolar transistor is accomplished by adding as few litho- 
graphy levels and processing steps as possible, in order to keep costs low. Techniques 
such as high-energy implantation for sub-collector formation and poly-silicon refill- 
ing for extrinsic base contacts help to eliminate unnecessary processing steps while 
preserving device performance. 

1 7.2. 1 High-energy-implanted sub-collector 

To reduce collector resistance in bipolar transistors, a buried layer, or sub-collector 
layer, is generally used. This low-resistivity layer is formed by implantation and dif- 
fusion into patterned areas on the substrate. These regions are subsequently covered 
using epitaxial growth by a layer having much higher resistivity to form the active col- 
lector region. A high-energy implantation process which eliminates many steps and 
therefore reduces cost and process complexity can replace this diffusion and epitaxy 
process. Table 17.1 illustrates the device performance for the 0.25 pm High-Energy 
Implanted BiCMOS (HEIBiC) process of Lucent Technologies [1]. The change in 
the cutoff frequency is strongly dependent on the breakdown voltage and thus on the 
collector depletion layer transit time. A trade-off between the collector-emitter break- 
down voltage and the cutoff frequency is clearly evident from Table 17.1. 

17.3 LATERAL ETCHING AND AMORPHOUS SI REFILLING PROCESS 

Lateral etching of Si02 is important for the fabrication of electronic device and micro- 
machine stmctures. Micro-machines, which are mechanical surface or bulk devices, 
use the lateral etching of Si 02 to free overlying stmctural layers made of polycrys- 
talline Si or Si 3 N 4 . Typical thicknesses of the oxide layer to be etched for micro- 
machine fabrication are in the range from 0. 1-0.5 pm, while the lateral etch distance 
can be hundreds of pm. For the fabrication of high-performance transistor structures, 
lateral etching to remove an insulating region and replace it with a semiconducting 
region is a useful process to lower costs and improve performance. In these devices, 
a cavity is first created by the removal of the insulator, usually SiC> 2 , using a solution 
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Table 17.1 HEIBiC device performance. 
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Figure 17.1 An SEM micrograph of a laterally etched trench feature with 6nm of 
thermal oxide under the poly-silicon, WSi* , and nitride layers. 



of HF acid. Subsequent to the lateral etching process, the cavity is filled with amor- 
phous Si (a-Si) or epitaxial Si to create an electrical contact to underlying layers. This 
lateral-etch-and-refill procedure is especially important in Bipolar-CMOS (BiCMOS) 
processes where integration of a super self-aligned bipolar transistor with the MOS 
devices must be accomplished with a minimum of steps to keep costs low [2,3]. 

In this work, we created extremely small cavity structures (down to only 2.5 nm) 
and refilled them with a-Si. To the authors' knowledge, these are the smallest struc- 
tures to be laterally etched and refilled. In addition, we report the etching and refilling 
characteristics of such small structures and compare the results to previous work. Prior 
work investigating the etch rate dependence on the cavity height, which is determined 
by the thickness of the oxide layer, is inconsistent. One group finds a dependence [4] 
while another does not [5]. In our work with very thin oxide layers, we see a depen- 
dence of etch distance on cavity height in the thickness range of 50 nm down to 2.5 nm. 
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In addition, we checked the dependence of the lateral etch distance on window open- 
ing size and trench opening size in the range of 0.18-0.65 pin using scanning electron 
microscopy (SEM). After creating cavities of various dimensions with lateral etching, 
we refilled them with an a-Si layer and examined the final stmcture with SEM and 
transmission electron microscopy (TEM). 

The fabrication of test structures began with conventional 150mm Si (100) p/p-i- epi 
substrates. We started the process by creating a stack on the wafer resembling a metal- 
oxide-semiconductor (MOS) gate or a bipolar junction transistor (BJT) extrinsic base. 
From the substrate, the stack consists of SiC> 2 , polycrystalline Si, WSi x , and silicon ni- 
tride. The stack formation process began with thermal oxidation. The initial oxidation 
step was carried out in a conventional vertical oxidation furnace except for the 2.5 nm 
oxide which was grown with rapid thermal oxidation [ 6 ]. Following the oxidation, we 
deposited a-Si at 550°C to a thickness of lOOnm using low-pressure chemical vapor 
deposition (LPCVD). Next, WSi* was sputtered to a thickness of 120 nm on top of the 
a-Si. To finish the stack formation, we used LPCVD to deposit 100 nm of SHN 4 . The 
stack was then patterned using 248 nm deep UV lithography with a mask comprising 
trench and window sizes ranging from 0.18 to 0.65 pm. The trench patterns have a 
width equal to the minimum size dimension and a length of at least 1 mm while the 
window features are circular with a diameter equal to the minimum size dimension. 
After photo-lithography, we etched the nitride using reactive ion etching and stripped 
the masking photoresist. Using the nitride as a mask, we continued to etch the WSi* 
and polycrystalline Si (converted from a-Siduring nitride deposition) layers with an 
etch that is highly selective to oxide. This selective etch allowed us to stop on the thin 
underlying oxide (2.5 to 50 nm) [ 6 ] and complete the test structure fabrication. With 
the trench and window structures now formed on the wafer, we immersed the sam- 
ples for 70 minutes in a recirculating bath of 0.49-weight-percent (wt. %) HF solution 
controlled to a temperature of 21°C. The etch rate of blanket thermal oxide layers in 
this bath is about 3 nm/min. Fig. 17.1 shows an SEM micrograph of a trench feature 
after lateral etching in HF on a sample with a 6 nm thermal oxide. The lateral etch 
distance is clearly demarcated by the undercut visible in the picture. The polycrys- 
talline Si, WSi*, and nitride layers are also clearly discernible. In Fig. 17.2, we plot 
the lateral etch distance in both trenches and windows for samples with underlying ox- 
ide thicknesses of 6 and 20 nm. For a given feature (i.e. trench or window), the lateral 
etch distance is approximately independent of feature size from 0.18 to 0.65 pm. For 
20 nm underlying oxide thickness, the average lateral etch distance in trench features 
is 183nm, while it is 166 nm for window features. A much larger disparity of the 
lateral etch distance between trenches and windows is apparent when the underlying 
oxide thickness is reduced to 6 nm. In this case, the average lateral etch distance for 
trench features is 119 nm, and for windows it drops to only 84 nm. 

Previously published results for lateral etching of much thicker oxide layers are in 
conflict with each other. Two groups working with nearly identical etching conditions 
(test structure, etch monitoring scheme, etc.) came to opposite conclusions regard- 
ing the dependence of etch rate on oxide thickness. Monk et al. observed almost no 
difference in the etching characteristics of structures with 134nm and 2.36 pm oxide 
layer thicknesses using a concentrated HF solution (49 wt. %) [5]. On the other hand, 
Liu et al. observed a large dissimilarity in the etching behavior of oxide channels with 
layer thicknesses between 240 nm and 1 .7 pm using a less concentrated 26.5 wt. % HF 




LOW-COST SiAND Si/Sii-xGe* BiCMOS FOR WIRELESS APPLICATIONS 



221 




0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 
Window or Trench Size (iim) 

Figure 17.2 The lateral etch distance for two samples with 6 and 20 nm oxide thick- 
ness with window and trench features of various sizes. The samples were etched for 
70 min. in 100:1 H 2 O : HF (0.49 wt. percent). 



solution [4]. Liu attributed the thickness dependence of the etching to a liquid-solid 
interaction between the silicon nitride over-layer and the HF solution. In our very thin 
oxide layers (less than 20 nm), we also see a pronounced reduction of the etch rate 
with oxide thickness. It is not readily apparent that the oxide should be etched at all 
given the extremely small dimension of our thinnest sample (2.5 nm) which is on the 
order of only 5 Si lattice constants. The etch solution may be forced into the cavity by 
capillary action or it may be drawn in by the hydrophilic nature of the oxide surface. 

The lateral etching of these cavities appears to be roughly described by simple mod- 
els for capillary flow. The speed of liquid flow into a capillary can be approximated 
by [7] 

_ Rh(YSV-YSL.) 

dt 4 rjz 

where R H is the ratio between the volume and surface area of the cavity, ysv and ysl 
are surface free energies between the solid-vapor and solid-liquid interfaces, and r\ is 
the viscosity of the liquid. For a rectangular cross-section capillary with a length much 
greater than its height, the liquid penetration after time t is given by 

z 

where h is the cavity height. Therefore the extent of the lateral etch after time t is 
proportional to the square root of the cavity height. Based on the data of Fig. 17.2, the 
ratio of lateral etch depths for the two different cavity heights of 20 and 6 nm varies 
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between 1.5 and 2, while the square root of the ratio of the cavity heights is 1.8. This 
result suggests that capillary action may be important for the etching of these small 
structures, however, further investigation is necessary. 

For transistor applications, the cavity formed by the lateral etch of the oxide must 
be refilled with a conducting film to facilitate a contact to the crystal region. As the 
gate length of Si CMOS technology scales downward, the gate oxide thickness of the 
transistor also scales downward. In order to achieve the maximum cost reduction in 
a BiCMOS technology, the lateral etch and refill process must be accomplished using 
the gate oxide to form the cavity. Therefore, we studied oxide thicknesses down to 
2.5 nm to cover the low end of the range expected for the next few generations of 
CMOS technology. 

In our test structures, we refilled the cavity with a-Si deposited at 550°C using 
LPCVD at 250mTorr. Fig. 17.3 exhibits the TEM results from the refill of a-Siinto 
gaps ranging from 50 nm down tojust 2.5 nm. Even in the thinnest sample, the refill of 
the cavity is complete without the appearance of large voids, which demonstrates that 
the deposition process is extremely conformal. This result shows that this technique 
indeed has great promise for future generations of BiCMOS technology. In the sam- 
ples with cavity thickness greater than 2.5 nm, the convergence of the top and bottom 
growth fronts is visible. In the 2.5 nm sample, however, the border of the growth fronts 
is not visible due to taper introduced into the gap before the refill process. This taper 
may come about from native oxide formation and removal. In all the samples, one can 
notice that the refilled region is slightly thicker than the oxide thickness, which may 
be due to native oxide formation along the top and bottom of the cavity walls while 
awaiting a-Si deposition. The native oxide is removed by a short etch in dilute FIF 
immediately prior to deposition, leaving behind a larger cavity, which is most easily 
seen in the TEM images of the thinnest samples. We also note that other experiments 
show that the border created by the coalescence of the top and bottom growth fronts 
disappears completely in all samples after a rapid thermal anneal (RTA) at 1050°C for 
10 seconds. 



17.4 Si/Sii-jGe* HETEROSTRUCTURE BIPOLAR TRANSISTORS 

The gain in a Si bipolar transistor is expressed as 



P % 



DbxeNe 
D e W b N b ' 



(17.3) 



where D is the minority carrier diffusion coefficient, *£is the emitter thickness, Wgis 
the base thickness, and N is the doping concentration. Modem Si bipolar devices are 
quickly approaching vertical scaling limits with base widths in the order of 50-80 nm 
and base doping levels well above 10 18 cm -3 . The desire to reduce the base thickness 
further to increase device speed (fy) is hampered by two fundamental constraints. 
First, the base doping level cannot be raised substantially above about 3 • 10 18 cm -3 
because of the onset of high tunneling currents at the emitter-base junction. A higher 
base doping level is required to avoid punch-through between the emitter and collector 
and is desired to minimize base resistance (increase / max ). Secondly, the need to 
maintain sufficient current gain requires that the emitter doping (which is already at 
the solid solubility) be substantially higher than the base doping. 
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Figure 17.3 TEM micrographs of laterally etched cavities that have been refilled with 
a-Si. The cavity height (oxide thickness) is 2.5 nm in (a), 6nm in (b), 20 nm in (c), and 
50 nm in (d). 



By placing a hetero-junction at the base-emitterjunction, these constraints can be 
considerably relaxed allowing continued scaling of bipolar devices. The expression for 
current gain of a hetero-junction bipolar transistor (HBT) is modified in the following 
way, 



DbxeNe 

DeWbNb 



exp 




(17-4) 



where A E g is the bandgap discontinuity between the large-gap emitter and smaller- 
gap base material. As AE g approaches several AT. the doping levels in the emitter 
and base can be completely reversed, allowing a heavily doped base layer to coexist 
with a lightly doped emitter region. This enhancement in the injection efficiency by 
using dissimilar materials in the emitter and base regions permits bipolar devices to 
be scaled vertically while improvements in lithographic technology allow them to be 
scaled laterally. 



17.4.1 Epitaxial gro wth and material properties of Sii- X Ge x 

Sii-^Ge* alloys represent an example of a perfectly miscible system, and therefore 
all compositions can be epitaxially grown as a single-crystal diamond lattice. Under 
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certain growth conditions on silicon, such alloys will grow as isolated “islands” rather 
than as a continuous two-dimensional plane. This three-dimensional growth is appro- 
priately termed “island growth”. Early molecular beam epitaxy (MBE) investigations 
into the deposition of Ge on Si repeatedly encountered problems with island growth. 
Two-dimensional planar growth was not achieved until Kasper demonstrated that the 
islands disappear when Sii-jcGe* alloys are grown on Si with Ge mole fractions less 
than 20 % [8—10]. Bean et al. later discovered that all compositions ofSii-^Ge* will 
grow island free, if the substrate temperature is kept within a window enclosed by 
polycrystalline growth on one side and island growth on the other [11]. They found 
that a substrate temperature of 550°C is within this window for all Ge concentrations. 
The onset of island formation is dependent on the germanium fraction in the alloy 
and occurs at increasingly higher growth temperatures as the germanium fraction is 
lowered. This ground-breaking discovery paved the way for future implementations 
ofSii-jGe* alloys in devices. 

Once two-dimensional planar growth is attained, there are still two options for the 
manner in which the alloy can grow on the Si substrate: strained and unstrained. In 
the case of an unstrained Sii-^Ge* alloy on a Si substrate, the larger Sii-^Ge^equi- 
librium lattice constant is maintained and, due to the difference in registration with the 
Si substrate, misfit dislocations occupy the inter-facial region. Misfit dislocations add 
energy to the system in the form of dangling bonds, making the other mode of planar 
growth, strained layer growth, energetically favorable under certain conditions. 

For the case of strained layer growth of Sii-^Ge* on a Si substrate, the in-plane lat- 
tice constant of the alloy layer compresses to match that of the underlying Si substrate. 
Because of the relative thicknesses of the epitaxial layer and the substrate, the change 
of the lattice constant of the Si substrate is negligible. Strain also adds energy to the 
system since the bond angles are altered. Strain energy increases with layer volume. 
Consequently, at some layer thickness, the energy to maintain strain in the growing 
layer will overcome the energy to produce misfit dislocations at the interface. Many 
researchers explain strained layer growth using equilibrium analysis. From equilib- 
rium calculations, one may find a critical thickness at the point where layer growth 
reverts from strained to unstrained. 

Fig. 17.4 shows the observed critical thickness for a Si-capped Si i-^Ge* alloy layer 
on Si as a function of Ge mole fraction. Grown thicknesses above the curve indicate 
meta-stable or unstable layers that are prone to misfit dislocation formation. Layer 
thicknesses at or below the curve are completely stable and will not relax through the 
process of misfit dislocation formation. Relaxation can occur, however, through the 
process of Ge diffusion, which transpires at sufficient thermal treatments (1000°C for 
several minutes). On the other hand, boron diffusion of the base is a greater concern 
for HBTs at these thermal budgets. 

From Eq. (17.4), we noted that the bandgap discontinuity, AE g ,is a very impor- 
tant parameter to determine the injection efficiency enhancement of an HBT device 
stmcture over that of a Si homo-junction structure. Fig. 17.5 displays the bandgap 
discontinuity for both strained [12] and unstrained [13] Sii-^Ge.* as a function of Ge 
composition. In the unstrained material, the bandgap changes rather slowly with Ge 
composition until a value of about 85 % is reached, at which point the bandgap dif- 
ference rises quickly. In the case of strained material, A E g is large with even small 
fractions of Ge and continues to rise. When the Ge fraction reaches values of only 
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Figure 17.4 Critical thickness vs. Ge alloy composition. 



60 % in the strained layer, the actual bandgap is smaller than that of pure Ge! There- 
fore, not only it is important for HBT applications to maintain complete strain in the 
alloy layer to avoid misfit dislocations, but also to obtain the maximum A E g possible 
for a given Ge fraction. 




Figure 17.5 Bandgap difference for strained and unstrained SiGe as a function of Ge 
composition. 



17.4.2 Sli- x Ge x bipolar transistor structures 

Since the first results that appeared in 1988, many researchers have begun examin- 
ing the prospects of Sii^Ge* HBTs. Because other hetero-junction systems have 
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been studied for years, many of the ideas conceived for HBTs in those materials are 
being applied to the Si/Si^Ge* system. Compositional grading at the devicejunc- 
tions, compositional grading in the neutral base, and the use of a lightly doped emitter 
are among some examples of ideas retrofitted for Si/Si^Ge* hetero-junctions. In 
addition, since this hetero-junction system is silicon based, researchers apply many 
concepts from silicon technology as well. Processes and ideas include poly-silicon 
emitters, ion implantation, rapid thermal and furnace annealing, and selective epitax- 
ial growth. Most of the work to date can be divided into two groups. The first class 
of devices contains standard HBTs which utilize a single-crystal emitter and a base- 
emitter hetero-junction to achieve an increase in injection efficiency. The devices in 
the second class use a poly-silicon emitter to increase injection efficiency and grading 
of the bandgap in the neutral base region to introduce an electric field to speed the 
transport of minority carriers across the base. 

Graded-baseSii-jGej,; bipolar transistors. Patton et al. [14,15] first introduced 
devices employing a poly-silicon emitter with a graded Ge profile. These devices use 
a Ge profile in the base that is graded from 0 % at the base-emitter junction up to a 
value of about 15 % at the base-collector junction. A secondary ion mass spectrom- 
etry (SIMS) plot of the Ge and boron profiles for this graded-base process is shown 
in Fig. 17.6. The graded Ge profile introduces a built-in electric field in the quasi- 
neutral base that reduces the transit time of injected electrons across the base thereby 
increasing / T . The vertical doping profile, and thus the base sheet resistance, can- 
not be altered from the homo-junction case, since the band offset at the base-emitter 
junction is near 0 eV. 




Death (nml 



Figure 17.6 SIMS profile of graded SiGe base bipolar transistor. 



Although grading the bandgap in the neutral base region lowers the base transit 
time, it does little to affect issues that address bipolar scaling. Since no hetero-junction 
is formed (aside from the poly-silicon/single crystal Si interface) at the base-emitter 
junction, base doping and current gain are still intimately coupled. Clearly, bandgap 
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grading increases /f over a homo-junction device, but it only weakly enhances injec- 
tion efficiency. Therefore, further scaling of the base width cannot be accomplished 
with neutral base bandgap grading alone. For high-speed bipolar circuits, both low 
base resistance and high cutoff frequency are important. Graded-base devices provide 
a good initial step for using SiGe, since existing bipolar processes can largely be used 
requiring only the exchange of an implanted base with an epitaxially grown one. The 
true performance limits of SiGe bipolar devices will be attained with Si/SiGe HBTs. 

Si/Sii_ x Ge x hetero-junction bipolar transistors. The strained-alloy bases with 
much higher Ge fractions (-30%) [16, 17] which are used in Si/SiGe HBT struc- 
tures allow much higher base doping (10 19 to 10 20 cm -3 ). Si/Si i- x Ge x HBTs have 
demonstrated impressive figures of merit for both frequency response and noise per- 
formance [18]. However, these transistors were fabricated using blanket epitaxy cou- 
pled with a double-mesa process that makes integration very difficult. In our work 
at Lucent, we demonstrate a device structure usable for high-level integration which 
utilizes an aggressive vertical profile (W B = 18nm, N B — 5 ■ 10 18 to 2.5 - 10 19 cm -3 
and x = 30 %) in a planar transistor. 




Figure 17.7 Si/SiGe HBT device structure. 



Fig. 17.7 shows our device structure. We used rapid thermal epitaxy (RTE) to grow 
the layer structure, utilizing the limited-reaction-processing mode of growth [19]. A 
secondary ion mass spectrometry (SIMS) plot after the device fabrication of a struc- 
ture with N b = 2.5 ■ 10 ,9 cm -3 is illustrated in Fig. 17.8. The very narrow base width 
is approximately 18 nm, which is below the equilibrium critical thickness for a capped 
SiojGeoj layer (Rgi ~ 2.0fc £2/D for N b = 2.5 x 10 19 cm -3 ). The boron base dop- 
ing is contained within the strained layer, indicating that transient enhanced diffusion 
effects are not important. The As profile shown in Fig. 17.8 is interrupted in the 
Sio. 7 Geo .3 layer due to mass interference effects between Ge and As. These effects 
also lead to an exaggerated value of the As concentration close to the base-collector 
junction. 

To evaluate the high-frequency performance of these transistors, we measured the 
S-parameters in common-collector mode from 2 to 18 GHz using a vector network an- 
alyzer and separated the effect of the parasitic pad capacitance using the Y-parameter 
subtraction technique. Devices with N B = 5 x 10 18 cm -3 yielded / T = 54 GHz, and 
with N B =2.5 x 10 l9 cnr 3 , fy = 44 GHz for V C E = 1.5 V. Extraction of the intrinsic 
delay using r ec vs. /jj 1 for N B = 2.5 x 10 19 cm -3 yields an intercept of 2.9 ps and a 
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Figure 17.8 SIMS profile of HBT structure after all processing. 



slope corresponding to Cbc + Cte = 209fF. Table 17.2 shows the delay elements for 
this device. From this data, we estimate that the collector charging time accounts for 
about 72% of the total delay, leaving much space for improvement in / T and / max , 
since the intrinsic delay is extremely small due to the thin 18 nm base width. 



Table 17.2 SI/SiGe HBT delay components. 



Component 


Delay 


Tb + t d 


l.Ops (28.1%) 


*We(Cbc + C B e) 


0.67 ps (18.9%) 


(Rc + Re)Cbc 


1.88 ps (53%) 



17.5 SUMMARY 

Low-cost Si BiCMOS will continue to be an important player in the wireless commu- 
nication equipment business especially with the push toward lower power and higher 
integration. However, Si bipolar devices are reaching fundamental limits in terms of 
base doping and thickness which can be alleviated by the use of SiGe bipolar devices. 
Si/SiGe HBT technology has progressed steadily since the first reports. Indeed, initial 
products are now beginning to emerge, and with the demand for wireless goods ex- 
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peeted to expand in the coming years, the need for this technology will surely grow. 
Graded-base devices are a first step to increase the cutoff frequency, but further de- 
vice scaling will require an HBT structure with a high Ge fraction at the base-emitter 
junction. In this paper, we introduced a process which utilizes selective and non-selec- 
tive rapid thermal epitaxy for growing a structure usable for high level integration and 
which uses a high Ge fraction of 30 % thereby permitting high base doping. High-fre- 
quency measurements of a 0.5 x 10 pjn 2 transistor yielded a peak fj of 54 GHz. The 
performance is expected to be greatly enhanced by reducing the junction capacitances 
(Cbc and Cte) through minor changes in processing conditions. 
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Abstract: GaAs-based IC technologies are now firmly established in mobile phones 
and wireless data transmission applications. The success of GaAs MESFET RFICs in 
recent years comes from the need for improved performance for digital standards and 
from a significant reduction of the manufacturing costs for wafer fabrication. While 
there is little doubt that the GaAs IC industry is important today, there are differing 
opinions about the role of RFIC technologies in future wireless systems. Some believe 
that GaAs MESFET technology will take over the entire RF section of future radios, 
while others believe the present foothold will be chipped away as higher performance 
silicon technologies come on line. Yet others believe that alternate GaAs technolo- 
gies, such as pHEMT and HBT, will gain greater acceptance for specific RF functional 
blocks. A minority believe that “radio on a chip” solutions in silicon will come to pass 
and some think it will include SiGe. In this paper, the author discusses the future of 
GaAs IC technologies in future commercial radio systems. The specific cost drivers 
for various process technologies are described and the key performance criteria that the 
mobile phone makers demand are reviewed. The combination of these factors shows 
trends in system architectures for future mobile wireless systems. 
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18.1 THE RF-INTEGRATION PARADIGM IS DIFFERENT 

As enabling components of the Information Age, wireless technologies are quickly 
moving from mere curiosities and conveniences to indispensable pieces of modem life, 
much in the same way that digital technologies have made the computer the backbone 
of modern enterprise. The advent and maturing of silicon IC process technologies 
have, over the last three decades, made the stunning achievements in modem com- 
puting possible. In hindsight it may seem that the current dominance of CMOS IC 
technologies was inevitable. However, only since the massive worldwide investment 
in CMOS production capacity in the 1980’s has it become clear that the staggering 
achievement represented in Moore’s Law (in which former Intel Chairman Gordon 
Moore stated that the density of digital ICs would double at half the cost every three 
years) could be realized. 

It is natural to extrapolate the success exemplified by Moore’s Law to all subse- 
quent IC technologies, but this ignores the fact that Moore’s Law has not worked for 
high-performance analog and RF ICs. RF applications especially have stubbornly re- 
sisted integration. Some workers in the analog IC community, well represented by the 
participants in this publication, believe that this resistance will be broken by the in- 
evitable improvements in the high-frequency performance of CMOS. This view taken 
to its extreme gives rise to the dream of “radio on a chip” integration. Following the 
digital integration paradigm, there are several assumptions built into the argument for 
ultimate RF integration in CMOS. Some of these assumptions are: 

1. All Silicon is Cheap 

2. Transistor Frequency Performance is the Limiting Factor 

3. “Radio on a Chip” is Possible or Desirable 

It is shown below that these assumptions do not stand up to close scrutiny, and that 
future wireless systems will contain a mixture of silicon and GaAs IC technologies. 

18.2 WHERE IS RF INTEGRATION BEING USED? 

Mobile telephones are currently the highest-volume application for high performance 
RF circuits. Until recently RF front-end circuits have been dominated by discrete 
devices. As the complexity of mobiles has increased and as TDMA and CDMA digital 
systems have come on line, we have seen a dramatic increase in the level of integration 
in the baseband and signal processing functions with the widespread use of DSP and 
integrated silicon for low frequency IF and analog functions. This has resulted in 
a commensurate decrease in the number of components needed for these functions. 
However, the RF portion of the radio has resisted integration until very recently. 

The greatest success for RF integration has been in receiver front-end circuits for 
GSM and DCS-1800 systems. Through the use of very sophisticated circuit design 
techniques in high performance silicon BiCMOS, suppliers like Siemens and Philips 
have successfully penetrated the market with modestly priced ICs. Dual band ICs for 
GSM and DCS-1800 are selling for approximately 2-3 US$ in million-piece quanti- 
ties. These circuits can be built mainly because the GSM standard is undemanding for 
IIP3 (input 3 ld order intercept point) and Noise Figure, and because these ICs make use 
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of a large number of external discrete passive components. Current silicon BiCMOS 
receiver ICs require more than 60 external components, for example. 

Since GSM standards are very well established and mobile handsets are becoming 
more like a commodity, there are few opportunities left for the introduction of new IC 
process technologies. The level of silicon integration in GSM and DCS-1800 handsets 
is likely nearing its peak. It will be in the newer and less widespread standards, such as 
TDMA IS-136 in the US, narrow-band CDMA IS-95 and the new wide-band CDMA 
likely to be used for 3 rd generation systems, where most of the challenging IC technol- 
ogy choices will still be made. Unfortunately for integrated silicon, whether CMOS or 
BiCMOS, the requirements for noise, linearity and power efficiency are much tougher 
than those for GSM. It is easy to have a GSM-centric viewpoint, especially outside 
the US, since GSM has become so dominant with so much popular appeal throughout 
the world. However, it is not a good idea to concentrate solely on GSM when looking 
at trends for future systems. Other systems like IS-136 and IS-95 offer important ex- 
amples of the demands to be put on IC technologies for RF sections of future mobile 
phones. With this in mind, it is illustrative to compare what IC technologies are being 
used in today’s digital mobile phones of different standards (see Table 18.1). 

18.3 GaAs FOR MOBILE POWER APPLICATIONS 

It is noteworthy that the GaAs content of an RF system increases generally as the lin- 
earity requirements of the system increases. This is true for both transmit and receive 
sides of the RF section. Based on the clear demand for maximum power efficiency, 
GaAs has dominated power amplifiers, especially in 3-volt systems. There has been 
some competition from LDMOS (laterally diffused MOS) in integrated modules, pri- 
marily from Japanese manufacturers, but these have been limited to 5-volt systems (or 
3-volt systems with built-in voltage doubler) with lower linearity requirements. 

With existing and emerging TDMA and CDMA systems, adjacent channel power 
(ACP) performance becomes much more important. This is where GaAs MESFET 
technology has a distinct advantage. For fully saturated power amplifiers, power 
added efficiency (PAE) values exceeding 60 % are routinely achieved, exceeding the 
efficiency of Bipolar and LDMOS power amplifiers. Narrow-band CDMA systems 
in particular have very stringent ACP requirements. For these systems, especially at 
3 volts, no other process technology has achieved greater than 35 % PAE in commer- 
cially available circuits. This is a result of the very low “knee” voltage (voltage that de- 
notes the transition from linear to saturated operation of the FET) of GaAs MESFETs. 
State-of-the-art power MESFETs have V^ee = 0.5 V or less, while silicon MOSFETs 
have Vknee > 1.5 V or more. Therefore, as wide-band CDMA systems come into being 
for 3 rd generation systems, it is clear that GaAs MESFET technology will continue to 
have a bright future for mobile power amplifiers. 

Two other GaAs technologies also show significant promise for power amplifiers. 
GaAs based hetero-junction bipolar transistors (HBT) are currently being used in high 
volume power amplifiers for GSM and IS-136 mobiles. The major benefits of HBT 
technology are low leakage, high PAE and reasonable linearity with a single power 
supply. The main disadvantages are the high cost and control issues associated with 
GaAs epitaxy growth. Much of this cost can be mitigated by the smaller die sizes 
achievable in HBT technology. It is not clear at this time whether MESFET or HBT 
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Table 18.1 RFIC Technologies Used in Today’s Mobile Phones. 



GSM 900 


DCS1800 & PCS1900 


Tx PA: 

5 V = LDMOS Module 
3 V = GaAs IC or Module 


Tx PA: 

5 V = LDMOS Module 
3 V = GaAs IC or Module 


Tx UPC & DA: 

BJT or BiCMOS IC 


Tx UPC & DA: 

BJT or BiCMOS IC 


Rx LNA + Mixer: 
BJT or BiCMOS IC 


Rx LNA + Mixer: 
GaAs Discrete 
BiCMOS IC 


TDMA 900 (IS-136) 


TDMA Dual Band 


Tx PA: 

5 V = GaAs Module 
3 V = GaAs MESFET IC 


TxPA: 

2 GaAs MESFET IC 
2 GaAs Modules 
Dual band GaAs IC 


Tx UPC & DA: 

5 V = BJT Discrete 
3 V = GaAs MESFET IC 


Tx UPC & DA: 

5 V = BJT Discrete 
3 V = GaAs MESFET IC 


Rx LNA + Mixer: 
GaAs MESFET IC 


Rx LNA + Mixer: 
Dual band GaAs IC 


CDMA 900 


CDMA 1800/1900 


TxPA: 

GaAs MESFET IC 
HBT IC & Modules 


TxPA: 

GaAs MESFET IC 
HBT IC & Modules 


Tx UPC & DA: 
GaAs MESFET IC 


Tx UPC & DA: 
GaAs MESFET IC 


Rx LNA + Mixer: 
GaAs MESFET IC 


Rx LNA + Mixer: 
GaAs MESFET IC 



will, in the long run, be more economical for power amplifiers. The other GaAs-based 
technology that has promise for power amplifiers is pseudo-morphic high electron mo- 
bility transistors (pHEMT). pHEMTs make the most power-efficient power amplifiers, 
especially at very low voltages and high frequencies. However, since pHEMT power 
amplifiers cannot be made significantly smaller than MESFET power amplifiers, the 
high cost of epitaxy material limits the use of pHEMTs in L-band and S-band mobiles. 
pHEMTs will likely remain in the domain of high frequency systems, at X-band and 
above. 
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18.4 GaAs IN MOBILE RECEIVERS 

Even most “radio on a chip” proponents admit that GaAs has a strong future in power 
amplifiers. However, they feel that small-signal RF parts, especially receivers, are 
the eventual exclusive domain of integrated silicon. Again the picture is not so clear. 
As seen above, standards that require higher linearity (as designated by ACP specifi- 
cations) tend to use GaAs for receiver LNAs plus down-conversion mixers. CMOS 
solutions, as presented by other authors in this book, require ever more elaborate cir- 
cuit techniques to meet even the very modest requirements of GSM and DCS- 1800. 
Again, meeting the linearity requirements of more challenging standards with simul- 
taneously low current draw favors the use of GaAs MESFET technology. The current 
consumption of receiver circuits is critical, since receivers are continuously on in mo- 
bile systems during standby. Today’s extremely long standby times in IS-136 systems, 
for example, are largely due to the use of GaAs MESFET technology in receiver LNA 
plus mixer ICs. The four major specifications that are important for receivers are: 1) 
Noise figure, 2) Linearity (as in IIP3 or ACP), 3) Gain, and 4) Current draw. GaAs 
MESFETs allow the simultaneous optimization of all four of these specs in a way 
not possible in CMOS. For example, the following specifications are only simultane- 
ously possible and commercially available in a GaAs MESFET based IC for 900 MHz 
IS-136: 

■ ACP/ALT > -29dBc/-49dBc 

■ Rx LNA/Mixer IIP > -8.5 dBm 

■ NF<2.7dB 

■ G=18.5dB 

■ Idd < 10mA 

(Refer to TriQuint’s TQ9222 data-sheet.) 

GaAs is not suitable for many of the low-frequency and digital functions of mobile 
handsets, but the technology has significant applicability for Tx up-converters, driver 
amplifiers, power amplifiers, and Rx LNAs and down-converter mixers. If it is so clear 
that technical advantages can be realized by using GaAs MESFET technology, why 
is there so much talk about stopping the spread of GaAs circuits and even replacing 
GaAs in mobile systems? This effort is fueled largely by the silicon digital integration 
paradigm discussed above. 

18.5 TESTING THE ASSUMPTIONS 

The mobile phone and wireless data hardware markets are brutally competitive. Any 
small cost advantage for performance that is “just good enough” is quickly exploited. 
With product life cycles of less than a year, there are continuous opportunities to insert 
new components into these systems. These pressures have led to dramatic improve- 
ments for integrated DSP and baseband functions. But, as mentioned earlier, the RF 
section has steadfastly resisted integration in silicon except in a few limited cases. The 
reasons go to the heart of the assumptions built into the “radio on a chip“ paradigm. 
Let’s examine these assumptions. 

Assumption #1: All Silicon is Cheap. This assumption comes from the notion 
that all silicon processes can take advantage of the economies of scale available to 
digital circuits. For state-of-the-art processing, the lowest cost per mm 2 comes from 
CMOS used for memories and microprocessors. The costs for DRAM processing are 
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fairly well known since these costs are monitored by the United States Department of 
Commerce as part of anti-dumping cases against Japanese and Korean DRAM makers. 
DRAM CMOS costs roughly 4 US cents per mm 2 for 200 mm wafers depending on 
utilization and yields. 

This may be well and good, but digital CMOS processes are not useful for RFICs. 
RF silicon is produced in much lower volumes, typically using 6-inch or smaller 
wafers. Rather than being defect-density limited, RFIC yields are strongly driven 
by parametric specifications. As a result, even for the highest volume silicon RF 
processes the costs per mm 2 are substantially higher. Unfortunately, these costs are 
not as well reported, but judging from foundry prices, these costs are at least 50% 
higher for high volume on 6-inch wafers, up to three or four times the cost for the 
highest-performance (Ft> 20GHz) BiCMOS processes. These costs are similar to the 
cost of GaAs MESFET processing for 4-inch wafers in moderate volume. 

There is little mystery about the cost drivers in the semiconductor industry, whether 
CMOS, silicon bipolar or GaAs. If we assume that the cost of packaging and testing 
is roughly the same whether the chip is silicon or GaAs, it is the cost of the processed 
chip where the real cost differences arise. The cost of a processed RFIC chip is a 
function of: 

■ Materials: wafers, chemicals, metals 

■ Number of Masking Layers: time in the fab, equipment utilization, labor 

■ Fab Capacity Utilization: spreading fixed cost 

■ Yield: wafer level, die level 

The primary disadvantage of GaAs has traditionally been the cost per mm 2 of the 
starting wafer, which can be 10 times the cost of a silicon wafer or about 4 to 5 times 
the cost of silicon epitaxy wafer. However, this is offset by the second component 
of cost. GaAs MESFET processing is significantly shorter than competing silicon 
processes. For example, a GaAs process with 2 LET types, capacitors, thin film resis- 
tor and 3 layers of metal is 14 layers. By comparison, RF bipolar processes with the 
same functionality have 21 layers, and BiCMOS processes run 25 or more layers. The 
main advantage of silicon processing is that it is run in higher volumes with better fab 
capacity utilization. As GaAs moves up the volume curve and to 6 inch wafers, this 
advantage of silicon will evaporate. Finally, yield is often the biggest lever in any fab, 
given the variable nature of yields in any analog processing line. For the larger- volume 
GaAs suppliers, wafer line yields of 80-90 % are common today, equivalent to silicon 
RF processes. 

Assumption #2: Transistor Frequency Performance is the Limiting Factor. 

There have been steady improvements in the frequency performance of CMOS and 
silicon bipolar transistors. As gate lengths decrease to 0.25 pm and below, the cutoff 
frequency (F,)of n-channel MOS transistors exceed 10 GHz, which is in the range of 
suitability for building RF circuits with usable gain. Likewise, RF bipolar processes 
with F, > 20 GHz are now commercially available. In the extreme, some SiGe HBT 
transistors have F t > 60 GHz. While transistor frequency performance is important, it 
is important to look deeper. First, F, is a misleading, some would say irrelevant, para- 
meter for RF circuits as opposed to baseband linear circuits. Instead, F max is a better 
measure of RF performance, since it applies to power gain. F max is proportional to F, 
by the factor of input resistance (Gate resistance for FETs, or Base resistance for bipo- 
lar). So while silicon technologies may have impressive F t values, the high resistance 
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of poly-silicon gates in MOS and thin bases in bipolar, limit the F max values to barely 
more than F, and sometimes even less. It is difficult to find MOS or bipolar transistors 
with F ma x values that exceed 20 GHz. In contrast, the very low gate resistance of GaAs 
MESFETs (typically less than 1 ft) give F ma x values 3 to 4 times F t . For example, a 
0.6 /rm GaAs MESFET with F, = 20 GHz has F max > 75 GHz. The importance of this 
for RF circuits cannot be overstated. High Fmax allows circuits to have enough gain 
overhead to produce very low noise figure with high linearity simultaneously. 

But even high F ma x is not enough to produce competitive RF circuits. The other im- 
portant factor is the influence of parasitics. The conductive silicon substrate severely 
limits the ability to build high-Q passive structures and to achieve isolation on chip. 
As a result, all “radio on a chip” circuits still have a large number of external passive 
components. Also, these circuits must use fully differential topologies in order to deal 
with the cross-coupling and isolation issues. As a result, practical CMOS based RF 
circuits ironically consume more current than discrete implementations. Therefore, 
the main advantage of GaAs MESFET RFICs is not necessarily higher performance 
than silicon RFICs or discretes, rather it is achieving the performance standards for 
challenging standards with lower current. 

Assumption #3: “radio on a chip” is Possible or Desirable. The term “radio on 
a chip” is somewhat misleading. Even the strongest proponents of this approach, rep- 
resented in other chapters of this book, readily admit that many of the important parts 
of the radio cannot be effectively integrated in silicon. These include filters, high-Q 
passive components, duplexers/switches, and most importantly power amplifiers. If 
these are not considered part of the radio, then the definition of “radio on a chip” is 
arbitrarily limited to that which can be integrated in silicon. Since even the most in- 
tegrated approaches have many dozens of external components, pushing all functions 
into a single IC becomes merely an academic exercise. 

It is telling that the only serious discussion of “radio on a chip” topologies is being 
done by university professors. One possibility for this may come from the belief that 
it is only a matter of advancing research to the point where industry can implement 
the new topologies. While this may be the case in some circumstances, “radio on a 
chip” research in fact misses a key marketing reality. The more you integrate, the 
more flexibility is sacrificed. Creating a new IC product requires huge investments in 
engineering and time. Therefore, IC designs must be useful long enough to produce 
an adequate return on investment. With the very short life cycle of mobile phones, IC 
designs must be useful for several generations. Also, it is impossible to completely 
know in advance exactly what performance and features ICs will need to have be- 
fore the mobile phone designs have been completely worked out. Therefore, flexible 
designs are crucial for making a commercially viable IC. 

Digital circuits have overcome the need for flexibility in three ways: 



1. When the function has become fully commoditized, like DRAM or Flash ROM. 
Therefore, the standards are known long in advance, de facto. 



2. When the circuits are customizable or programmable. Programmable logic arrays 
and DSP can be used for several generations of systems, since their programs can 
be changed. 
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3. When the standards are set by the semiconductor maker. Intel Corporation enjoys 
the ability to make microprocessors that determine the direction of computer devel- 
opment. All others must follow. 

None of these conditions are likely to be present in the RFIC market any time soon, 
if ever. When integration occurs, it will be driven by powerful economic and political 
forces. As of this writing, there are no compelling reasons for semiconductor makers 
to invest in a single-chip RF solution in CMOS, when a multiple -chip solution using 
IC technologies optimized for different parts of the radio provide the flexibility and 
performance that system makers desire. The goal for system makers is for optimum 
integration, rather than total integration. Even in the digital world, chip partitioning is 
the key for optimum cost and yield. For this reason, Pentium® II modules are actually 
composed of multiple chips each optimized for different functions. Likewise, with the 
widely divergent demands on IC technologies in RF systems, there is little incentive 
to push all functions into a single sub-optimized CMOS IC. 

18.6 ADVANTAGES AND DISADVANTAGES OF DIFFERENT RFIC 
TECHNOLOGIES 

Each RFIC technology has its advantages and disadvantages. Tab. 18.2 is a brief 
summary of the features of each technology. As this is a simplistic list of “pluses” 
and “minuses” rather than a comprehensive analysis, and because these technologies 
will change overtime, it is important to not read too much into the details of the list. 
Rather, the purpose of the list is to illustrate that each RFIC technology has its place in 
the range of choices available for RF systems. Tab. 18.3 shows the relative advantages 
and disadvantages of RFIC technologies for various RF blocks. 

18.7 PREDICTING THE FUTURE: WHERE ARE CONSUMER RADIOS 
GOING? 

With little doubt, the picture will remain complicated for mobile phones. As long as 
there are multiple standards, and with product life cycles so short, there will remain 
many opportunities for the various IC technologies to be used. Fig. 18.1 is a diagram of 
a typical mobile today. It schematically illustrates the fact that the RF section remains 
fragmented, dominated by discrete devices and single-function ICs. 

Fig. 18.2 shows a likely scenario for future systems. There may be opportunities for 
further integration in digital and analog sections, but the greatest change will be in the 
RF section. Since there will be continuing needs for various technologies optimized 
for the different RF blocks, multi-chip RF modules pulling together these technologies 
is a more likely than “radio on a chip” integration in CMOS. 

Likely future RF developments favor increased use of GaAs for two major reasons: 

1. Increased demand for linearity, noise and power efficiency performance as de- 
scribed above. 

2. Insulating substrate allows for integration of high quality passives (Q > 17 @ 

2 GHz, for example) and excellent pin to pin isolation. 
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Table 18.2 RFIC Technologies Considered. 



Silicon BJT and BiCMOS 

+ Established Infrastructure, Flexible Design 

— Low Q Passives Limits Integration 

— Parasitics Limits RF Integration 

Silicon MOS 
+ Lowest Die Cost 

— Can’t Get RF Performance With Low Current Draw 

— Parasitics Limits RF Integration 

GaAs MESFET 

+ Established Infrastructure, Similar Cost to RF BiCMOS 
+ RF Performance: Low NF, High PAE, High Linearity (ACP, IP3) 

+ High Q Passives Enable RF Integration 

— Need Negative Supply (or Bias Generator) for Power 

— Need External Power Switch 

GaAs HBT 

+ Single Supply and Easy to Design, Small Power Amplifiers 
+ Very Good Linearity and PAE 

— Limited Material Availability, Expensive Epi 

GaAs pHEMT 

+ Best PAE, Especially at Low Voltage 

— Difficult to Manufacture, Very Limited Availability of Starting Material 

— Cost Limits Integration 

SiGe HBT 

— No Real Products or Production Heritage 

— SiGe Epitaxy is hard to do in reality 

— Low B V cb0 , Bad for Power 

— Si bipolar nearly matches performance with far fewer process layers 

— Many more mask layers, > 30 layers 



As a result for half duplex TDMA systems, ICs merging PA plus Switch plus LNA 
in GaAs are attractive without compromising RF performance. For CDMA systems, 
which have very large dynamic power control requirements favoring GaAs, ICs merg- 
ing PA plus Driver Amp plus Up-converter in GaAs are also attractive. Since CDMA 
systems are full duplex, it is unlikely that it will be possible to merge Rx and Tx in a 
single IC of any technology, even GaAs. Therefore, receiver ICs will likely stay sepa- 
rate from transmit ICs in CDMA. Direct Conversion Rx in Si CMOS remains a strong 
possibility for less challenging standards like GSM. Regardless of process technology, 
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Table 18.3 RFIC Technologies by RF Block Function. 



Function 


BiCMOS 


CMOS 


GaAs FET 


GaAs HBT 


SiGe HBT 


LNA 


Good 


Fair 


V. Good 


Good 


Good 


Mixers 


Good 


Good 


V. Good 


V. Good 


? 


PA 


Fair 


Good 


V. Good 


Good 


Fair 


RF Switch 


Poor 


Poor 


V Good 


Poor 


Poor 


VCO 


V. Good 


Good 


Fair 


Good 


? 


Pre-Scalers 


V. Good 


Poor 


Fair 


Good 


? 


Dig. Functions 


V. Good 


Good 


Fair 


Fair 


? 
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Figure 18.1 Today’s mobile phone system. 



it appears that RF circuits will modularize into commodity chip sets by standard. Only 
then will a consensus emerge. And only then will it be clear what the optimum level 
of RF integration will be for each RF block. 

18.8 LOW-TECH GaAs MESFETS: COST-EFFECTIVE RF INTEGRATION 

The main reason why GaAs has successfully penetrated consumer radios is that ion- 
implanted MESFETs have become a main-stream IC technology. Using production 
IC process tools and processing techniques including ion implantation, along with 
low cost surface mount packaging, GaAs MESFET manufacturing can take advantage 
of the economies of scale of silicon IC manufacturing. When this is parleyed with the 
inherent RF advantages of an insulating substrate, low-loss metalization and low-dis- 
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Figure 18.2 Mobile phone system Vision. 



tortion/low-noise FETs, GaAs MESFET technology provides a natural foundation for 
optimum RF integration. 

The state of the art in production ion implanted GaAs RFIC technology for mobile 
phones is well represented by Tri Quint Semiconductor’s TQTRx process. As its name 
implies, the process is useful for receive and transmit applications. Currently, tens of 
millions of LNA, Mixer and Power Amplifier ICs built in TQTRx are being shipped 
per year. As shown in Tab. 18.4, TQTRx has Enhancement (normally off), Depletion 
(normally on) and Power FETs integrated with thin film resistors, MIM capacitors 
and three layers of gold metalization. Fig. 18.3 shows a cross sectional diagram of 
TQTRx. 

18.9 CONCLUSION 

The Role of GaAs in Consumer Radios: It is rare in any high-technology business to 
see clear directions for which component technologies will dominate future systems. 
When a very promising new application arises, technology proponents and component 
suppliers scramble to take advantage of the opportunity. Wireless systems including 
mobile phones, pagers and wireless data systems are certainly "killer applications” for 
the late 1990s. 

In the inevitable response to the opportunities in wireless, the proponents of vari- 
ous RFIC technologies have lined up to take part in the excitement. Some proponents 
of highly integrated CMOS “radio on a chip” ICs are quick to say that the domi- 
nance of CMOS in digital systems will naturally extrapolate to RF. Even if it were 
technically possible to produce these circuits, to say that they would be commercially 
viable ignores some fundamental market realities. By testing the assumptions built 
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Figure 18.3 TQTRx process cross section. 



Table 18.4 Key Parameters of TriQuint’s TQTRx Process. 



Parameter 


Units 


TQTRx 


MESFET Types 




E, D, Power 


E-FET V th 


V (E) 


+0.15 


D-FET V p 


V (D, Power) 


-0.6, -2.2 


Gate Length 


txm 


0.6 


Power Output 


mW/mm 


>300 


Ft* F max 


GHz 


20, > 70 


^min 


dB 


< 1 


Avalanche Breakdown 


V 


> 15 


Interconnect 


Total Layers 


3, Total 6.5/u.m thick Gold 


Inductor Quality 


Q Factor 


>17 at 1.9GHz 


MIM Caps 


pF/mm 2 


1200 


Mask Layers 




16 



into the reasons for integration, it is clear that multiple IC technologies will be used 
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in consumer radios for the foreseeable future. This will be especially true as more 
challenging standards come on line. 

Because of the inherent advantages of GaAs IC processes for RF circuits, GaAs 
has a very bright future in the RF blocks in consumer radios. Instead of searching for 
“one size fits all” solutions, RF systems designers search for components that provide 
optimum solutions, in both cost and performance, for each part of the system. As a 
mature IC technology, GaAs MESFET provides many opportunities for low-cost sys- 
tem insertions with the necessary RF performance and higher power efficiency than 
silicon. While virtually everyone admits that GaAs has advantages for power ampli- 
fiers, closer examination reveals that GaAs also offers significant power advantages 
throughout the RF section of mobile phones. These advantages will become enabling 
components for 3 rd generation systems based on wide-band CDMA. State-of-the-art, 
high- volume GaAs MESFET RFIC processes like TriQuinf s TQTRx will play an im- 
portant role to making these systems a reality. 
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Abstract: Different RF IC technologies are compared regarding their application, 
frequency range, and suitability for RF and baseband integration. The fully monolithic 
“phone on a chip" may be technically challenging to realize, and its commercial success 
may be challenged by progress in multi-chip integration. In the second part, examples 
of state-of-the-art RF monolithic circuits for applications in the frequency range of 2 to 
17 GHz are presented. 

19.1 SOME MYTHS ABOUT RF INTEGRATED CIRCUITS 

19.1.1 Is using a single technology an advantage? 

There is a lot of hype about the single technology which will enable a complete RF 

and baseband section of a radio transceiver to be integrated on a single chip. The 

presumed advantages of such a complete integration are stated as follows: 

1. Complete integration will deliver a performance superior to a multi-chip radio. 

2. Completely integrated radios will have a smaller size. 
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3. Complete integration will yield cheaper radio transceivers. 

4. Completely integrated radio chips will be easier to apply in large quantities. 

5. Complete integration will have a cost advantage over multi-chip solutions. 

Before testing the validity of each of these statements, let us briefly analyse the tech- 
nologies available for RF integration and their future development over the next years. 
Then, a summary of current baseband IC technologies is needed for investigating a 
possible fully integrated transceiver chip. 

19. 1.2 Present technologies for RF Integrated circuits 

Silicon bipolar. The Silicon Bipolar transistor (Si-Bip) in today’s technologies typi- 
cally has a transit frequency f T of approximately 27 GHz and a maximum frequency 
of oscillation / max on the order of 30 GHz. This technology is well suited for RF ap- 
plications up to 2.5 GHz, for some functions up to 10 GHz. It is readily available from 
different foundries as a full-custom or semi-custom (array-type) technology. Future 
evolution will be towards hetero-junction bipolar devices, e.g. using SiGe as a base 
material. This will increase f T to over 50 GHz. Standard bipolar technology will 
gradually be replaced by CMOS. 

Silicon CMOS. Today, CMOS technology is in a process of rapid reduction of the 
minimum gate length of the MOS transistor. This reduction is mainly driven by digital 
CMOS applications, e.g. high-speed microprocessors, and the ever increasing storage 
density of dynamic random access memory (DRAM) chips. In line with the shrinking 
gate length, the breakdown voltage and thus the operating voltage is reduced to below 
2.5V. 

With the 0.25 pm gate length transistors used in state-of-the-art microprocessors 
it is possible to build RF circuits with good performance up to 1.. .2 GHz on stan- 
dard substrates. The intrinsic < 0.2 pm transistor will perform well up to 10 GHz, 
as has been shown by Silicon-on-Sapphire (SOS) CMOS circuits [1]. However, the 
substrates used in standard CMOS processes are highly lossy for RF. This reduces the 
performance of passive elements or even the interconnections in RF circuits consider- 
ably. 

Future 0.18 pm and 0.13 pm CMOS technologies will yield devices with a perfor- 
mance comparable to today’s GaAs MESFET device. 

Silicon BiCMOS. BiCMOS technologies try to blend the high integration level of 
CMOS with the current drive capabilities of the bipolar device. This generally results 
in a quite complex manufacturing process using more than 20 mask levels. BiCMOS 
is therefore more expensive than other technologies, particularly for low volume ap- 
plication-specific chips. In addition, some compromises have to be made in the design 
of the CMOS and bipolar transistors to make them compatible in a single process. 

With the improved performance of plain CMOS at reduced gate lengths it is likely 
that the bipolar device of the BiCMOS process is no longer needed in many applica- 
tions. 
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Gallium arsenide MESFET. The RF performance of Metal-Semiconductor FETs 
(MESFETs) can profit from the higher electron mobility of GaAs compared to silicon. 
A typical MESFET with 0.6 pm gate length has an f T of approximately 20 GHz and an 
/max of 40... 50 GHz. The higher / max makes most RF functions feasible up to 5 GHz. 
An advantage of GaAs over silicon for RF integrated circuits is the high resistivity of 
the undoped substrate. Passive elements like inductors can be realized on chip with 
adequate performance. For RF circuits using reactive (L-C) matching, the limiting 
device figure of merit is / max rather than f T . Technologies like GaAs MESFET with 
good / max and good passive elements have therefore a performance advantage not 
directly visible from f T alone. 

GaAs and silicon germanium HBT. In a Hetero-j unction Bipolar Transistor (HBT), 
the base is made of a different semiconductor material than the emitter and collector 
regions, e.g. a GaAlAs emitter with a GaAs or InGaAs base for GaAs based transistors 
or a SiGe base for silicon based devices. Due to the lower bandgap base material the 
base can be doped higher without losing the current gain. The reduced base resistance 
boosts f T and / max towards 40 GHz and higher. GaAs based HBT ICs are already 
in volume production [2], SiGe HBT processes are just becoming commercially avail- 
able [3 1. 

High-electron-mobility transistor (FIEMT). The HEMT is a field effect transistor 
in which the doping layer is separated from the conducting channel by using a special 
layer design with different bandgap materials, such as AlGaAs in the GaAs material 
system. Thus the mobility of the electrons is higher in the channel compared to a 
standard MESFET. GaAs HEMTs can achieve an / r of above 60 GHz at 0.25 pm gate 
length, while HE MTs on Indium Phosphide reach an f T above 100 GHz at similar 
gate lengths. Together with the high / max (> 200 GHz) this allows to build circuits 
with operating frequencies of 100 GHz and more. 

Comparison of RF device technologies. Figure 19.1 compares the application fre- 
quency ranges of different semiconductor technologies available for advanced inte- 
grated RF subsystems. The broken bars in the chart indicate the foreseen future expan- 
sion of silicon based technologies towards higher frequencies. 
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Figure 19.1 Application frequency range for different RF 1C technologies. 
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Trying to avoid the common mistake of comparing a technology 3 years in the fu- 
ture with a competing one available today, one should be aware that today’s > 10 GHz 
technologies (MESFET, HBT, HEMT) will also improve over time. However, most 
high- volume commercial applications for RF ICs within the next 5 to 10 years will be 
in the frequency range up to 10 GHz. 

19. 1.3 Technology choices for baseband circuits 

In the baseband IC sector, CMOS is expected to keep its leading role. With the shrink- 
ing gate length of the MOS transistor, higher performance CMOS digital circuits can 
be designed. Thus the application areas for BiCMOS circuits are likely to diminish 
due to the higher manufacturing complexity of BiCMOS processes. 

When considering RF and baseband integration, the highest potential therefore ex- 
ists in the integration of pure CMOS RF and baseband circuits. The operating voltage 
of CMOS circuits is being reduced with shrinking gate length from 3.3 V down to 
2.5 V and 1.8 V, eventually going down to 1 V or less. 

19. 1.4 Pros and cons of complete RFand baseband Integration 

Coming back to the often cited performance advantages of full RF and baseband inte- 
gration: 

1. higher performance, 

2. smaller size, 

3. lower cost, 

4. easier application, 

and considering the diversity and high performance of the above-mentioned RF IC 
technologies, it may be difficult to achieve the first advantage very soon. Concerning 
the other goals, there are also a few major obstacles, in particular related to the diverse 
properties of RF and baseband circuits: 

1. The development tools for RF and digital baseband circuits are still very different, 
with RF design tools usually not offering as many top-down design possibilities as 
digital design tools. 

2. As a result, development times for RF circuits are often longer. 

3. Flexibility of a fully integrated solution can be much reduced over a multi-chip 
system, which is of special concern in applications that do not require millions of 
identical circuits. 

4. Testing combined RF and baseband circuits can require a very complex and expen- 
sive equipment setup, it may not be possible to have test point access at all required 
instances, and built-in-self-test for RF is still in its infancy. 

5. Processes used for RF and baseband, even when both are CMOS, may still be 
incompatible, e.g. when combining a low voltage digital part with a power amplifier 
in the 1 W class. 
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6. Integration of RF and baseband requires a technology which can do both with good 
performance, high yield and low cost. 



7. Since competing technologies like Multi-Chip Modules (MCM) are progressing as 

well, the cost of a fully integrated radio may not be competitive for some time. 

Even if many people are working on integrated RF and baseband transceivers, the 
fully integrated “radio on a chip” will need time to develop. The question is not 
if the technology allows to build it — it will soon — but rather if it is commercially 
competitive with a set of a few optimised single chips. The decision for or against 
full RF-baseband integration will also be influenced by the manufacturing volume and 
the flexibility needed in a particular product design. A device such as the proposed 
"Bluetooth” [4] could soon be integrated in a single chip, whereas a true single chip 
GSM phone is still a dream of the future — and may continue to be a dream. 

In the following section, a few state of the art RF chips in the GHz frequency range 
will be reviewed. 

19.2 EXAMPLES OF GHz TRANSCEIVER CIRCUITS 

19.2. 1 Low noise amplifiers In the 2 GHz range 

LNA on Si bipolar array. As an example of a design on a silicon bipolar array, a 
two-stage low noise amplifier (LNA) is shown in Fig. 19.2. A high-gain common emit- 
ter stage (El-RL) is followed by an emitter follower output stage (E2). DC feedback 
around the first stage transistor is used for bias stabilisation. External inductors are 
used for RF noise and gain matching. 




Figure 19.2 Schematic diagram of 1 .9 GHz LNA on silicon bipolar array. 
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On the test circuit (Fig. 19.3), the matching inductors are realized as printed induc- 
tors on the substrate. They have a higher Q-factor than inductors integrated on the 
silicon chip. A noise figure of 2.3 dB could be achieved at 1.9 GHz [5]. 




Figure 19.3 The 1 .9 GHz LNA mounted on a test board with matching inductors. 



This amplifier has a power consumption of only 5.2 mW at 3 V supply voltage. 
Together with the gain of 15 dB, this results in a high figure of merit for gain per 
DC power of 2.9 dB/mW. As can be seen from the comparison chart in Fig. 19.4, this 
amplifier is competitive with LNAs fabricated with other technologies. The chart also 
shows that it is difficult to achieve the same low noise figure on silicon that is possible 
with amplifiers on a high resistivity substrate such as GaAs. 

LNA in GaAs MESFET technology with sub-1 V supply voltage. For portable 
communications applications, the power consumption of the receiver can be min- 
imised by reducing the operating voltage to below 1.5V. This may not be possible 
for the transmitter, where high efficiency is very difficult to achieve at supply voltages 
less than 2 V. 

The schematic diagram of a low-noise amplifier operating with less than 1 V supply 
voltage is shown in Fig. 19.5 [6]. Key features are a RF load at the drain of the FET 
with almost zero DC voltage drop, and a bias stabilisation circuit working down to less 
than 1 V, as the simulation in Fig. 19.6 shows. Furthemiore, this LNA was designed for 
a 200 £2 load impedance of the following mixer stage. In Fig. 19.7 the gain is plotted, 
which was measured in a 50 system and recalculated for 200 ft load impedance. 
With 0.9 V supply voltage, the gain is still higher than 10 dB. The gain is varying less 
than 1.5 dB for supply voltages between 0.7 and 1.2 V. 

The measured noise figure is below 3 dB between 1750 and 2000 MHz (Fig. 19.8). 
The chip area is mainly used by the inductors and the bond pads, as the micro-photo- 
graph in Fig. 19.9 shows. Other performance data include a -1 dB input gain compres- 
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Figure 19.4 Comparison of 1.9 GHz LNAs (from [5]). 




Figure 1 9.5 Schematic diagram of sub-1 V LNA. 



sion point of -14.5 dBm and an input third-order intercept point of -1.5 dBm at 0.9 V 
supply voltage. 

19.2.2 5 GHz LNA with switch for antenna diversity 

There are a number of proposals which require diversity reception in wireless mobile 
terminals. Most of them use separate low noise amplifiers and a diversity switch. Re- 
cently, a MCM integration of a diversity front-end was demonstrated. Power consump- 
tion and chip size of the receiver front-end can be further minimised with a monolithic 
one-chip integration of the LNA and the diversity switch function. The block diagram 
of such a “diversity LNA” is shown in Fig. 19.10. 

The diversity LNA achieves a minimum noise figure of 3.5 dB at 4.5 GHz with a 
maximum gain of 14dB using the enhancement device of a standard E/D-MESFET 
foundry process. For comparison, a single LNA was designed on the same process 
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Figure 19.6 Supply current variation of the sub-1 V LNA versus supply voltage. 
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Figure 19.7 Measured gain versus frequency of the sub-1 V LNA versus frequency. 



showing very similar gain and noise performance [7]. Both chips use full on-chip 
matching circuits. By comparison of the chip sizes in Fig. 19.1 1 and 19.12 it becomes 
clear that the designed diversity LNA requires at least 30 % smaller chip area than a 
multi-chip solution without performance penalty. 

The ratio between the on and off state of a diversity channel is more than 20 dB 
(Fig. 19.13). The noise figure of each channel is plotted in Fig. 19.14. These measure- 
ments were made on chips mounted on a test board. Due to a longer bond wire, the 
frequency response of one channel is shifted to a slightly lower frequency. 

The diversity LNA uses a single 3.3 V supply voltage for compatibility with low- 
power CMOS circuits. Power consumption is 13 mW (4 mA from a 3.3 V supply) with 
one diversity channel switched on. This is a reduction of roughly 50% compared to a 
conventional diversity receiver using individual LNAs. 
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Figure 19.8 Measured noise figure of the sub-1 V LNA at 0.9 V/2 mA bias. 




Figure 19.9 Micro-photograph of the sub-1 V LNA (chip size 1 x 1 mm 2 ). 



19.2.3 1 7 GHz PHEMT power amplifier 

High-speed wireless local area networks (WLAN) for indoor use are becoming more 
and more important. WLAN systems with a data rate up to 2Mb/s are already stan- 
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1st stages 

Figure 19.10 Block diagram of the 5 GHz diversity LNA. 




Figure 1 9.1 1 Chip photo of the 5 GHz diversity LNA (size 1 .5 x 1 .5 mm 2 ). 



dardised in IEE E 802.11. To provide higher transmission capacity, new frequency 
bands have to be allocated. The emerging European HIPERLAN standard addresses 
frequency spectra in the 5 GHz and 17 GHz range. In the European ACTS project 
“Wireless ATM Network Demonstrator’’, a system with data rates of 20Mb/s in the 
5 GHz range has been developed [8] and investigations of the requirements for data 
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Figure 19.12 Chip photo of single LNA for comparison (size 1.5 x 0.7 mm 2 ). 
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Figure 19.13 Measured gain versus frequency of the diversity LNA channel 1 
switched ON and OFF. 



rates up to 155 Mb/s in the 17 GHz band have been made, taking into account system 
architecture and technological constraints. 

As one of the key components of a future 17 GHz wireless LAN, a monolithic 
integrated power amplifier has been designed. A two-stage on-chip matched power 
amplifier with 200 mW output power was realized on a commercial pseudo-morphic 
HEMT process (Philips D02AH). Fig. 19.15 shows a block diagram of the amplifier. 

The transistor of the driver stage has a gate width of 300 pm. The input matching, 
the gate and drain biasing network as well as the interstage matching network are inte- 
grated on the chip. In the power stage, the 1200 pm transistor and its gate biasing and 
the output matching network are integrated. The drain bias network is external due 
to the high DC current required by the transistor. The low-pass-type output matching 
network, shown in the basic schematic diagram (Fig. 19.16), allows for external bias- 
ing. No harmonic tuning was done at the output. The output stage is designed to be 
conditionally stable. Negative feedback stabilises the driver stage. 

The whole circuit was simulated with a harmonic balance simulator. The HEMTs 
are simulated with a Statz MESFET model modified with an RC network between 






256 



CIRCUITS AND SYSTEMS FOR WIRELESS COMMUNICATIONS 



HP8970B NoiM FiguM Meter 




30.00 



20.00 



Figure 19.14 Measured noise figure of the three channels of the diversity LNA. 
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Figure 19.15 Block diagram of the 17 GHz PHEMT power amplifier IC. 



drain and source. Thus, small inaccuracies in the large signal simulation results are 
expected. 

As the chip photograph in Fig. 19.17 shows, via holes to the back metal provide 
an optimum source grounding, but limit a maximum layout density by the minimum 
distance required between two adjacent via holes of 200 pm. Nevertheless, the layout 
is very compact with a chip size of 1 x 15 mm 2 . 

The measured fundamental power, the power added efficiency and the drain effi- 
ciency at 17.2 GHz agree with the harmonic balance simulation as shown in Fig. 19.18. 
The output power at -1 dB gain compression is 20dBm, the saturated output power is 
23 dBm at 3.3 V supply voltage. The measured power added efficiency has a maxi- 
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Figure 19.16 Basic schematic diagram of the 17 GHz power amplifier 1C. 




Figure 19.17 



Chip photograph of the 17 GHz power amplifier (size 1 x 1 .5 mm 2 ). 



mum of 20 % and the drain efficiency is 31.5 %. More measurement data can be found 
in [9], 

19.3 CONCLUSIONS 

In the foreseeable future, special RF IC technologies will still be required for applica- 
tions in the GHz range, particularly when high transmit power or a low noise figure is 
important. For low-cost, lower performance applications such as the proposed Blue- 
tooth, full monolithic RF and baseband integration in CMOS must be the goal. 
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Figure 19.1 8 Output power, drain efficiency and power added efficiency (PAE) of the 
1 7 GHz monolithic power amplifier. 
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20.1 INTRODUCTION TO THE FOLLOWING PAPERS 

Recently, wireless communication has turned into a very fast-growing market. GSM 2 
gives a good example with a total number of 135 million users world-wide by the 
end of 1998, of which 65 million new customers subscribed in the last year. This 
impressive growth is based on the underlying technologies. The latter are the topic of 
this part: the study of complex communication functionality and related algorithms as 
well as their realization in hardware and software. 

Digital Signal Processors (DSPs) are a key component for implementing communi- 
cation algorithms. Very often, we encounter embedded DSP cores where the DSP is 
integrated on a silicon chip together with a controller core, memories, hardware accel- 
erators, and dedicated periphery blocks. For many algorithms, the DSP represents the 
essential block in the implementation, e.g. voice codecs, channel equalizers, error cor- 
rection, encryption, interference suppression, multi-channel and multi-beam reception 
as well as additional features like voice recognition. DSP performance is often mea- 
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2 Global System for Mobile Communications; second-generation digital cellular mobile system 
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sured in terms of million instructions per second (MIPS). Since the influence of the 
considered architectures is not reflected in this MIPS figure, a benchmarking exercise 
based on well-defined algorithms is recommended to compare DSP cores. 

Wireless communication systems impose demanding requirements on the DSP, es- 
pecially for mobile terminals with inherent constraints on size, weight, and battery 
capacity. In consumer applications, overall system cost and low power consumption 
represent the main key driving factors. 

20.2 TRENDS 

The trend towards digitization will continue: functionality which was previously real- 
ized in the analog domain is more and more implemented by digital signal processing. 
This tendency follows the CMOS process road-map with continuously decreasing fea- 
ture size along the time axis. It also results in a continuous cost reduction in integrating 
digital circuitry because the number of building blocks integrated on one piece of sil- 
icon can be further increased. This leads more and more to a fully integrated solution 
which is often referred to as a “system on silicon.” 

The demand for wireless transmission of data, and with it the need for more band- 
width, is growing. Yet, bandwidth is a limited resource. Therefore, the increasing 
capacity demands for wireless voice and data transmission will result in a higher com- 
plexity and efficiency of future systems. 

These trends clearly indicate that the requirements increase with respect to MIPS 
figures, low power consumption, high-level design and programming approach. Im- 
proved support for implementing floating-point algorithms is another important issue. 
It addresses the rising numerical complexity and the short time-to-market require- 
ments. It will be interesting to see how all these trends will be realized. We can 
certainly expect a lot of innovation from future DSP architectures and applications. 

20.3 PRESENTATION OVERVIEW 

The first contribution of H. Keding et al. deals with the implementation of floating- 
point algorithms in a fixed-point architecture. 

The next two papers investigate different hardware solutions for implementing dig- 
ital signal processing algorithms. E. Lambers et al. present the "R.E.A.L. DSP,” a pro- 
grammable architecture with a flexible application-specific instruction set, which can 
be tailored towards the algorithmic needs. Advanced algorithms for implementation in 
receivers are described in B. Hailer’s contribution. They lead to a dedicated, yet con- 
figurable hardware approach. The fourth paper, authored by J. Sanchez, summarizes 
the requirements and trends of cellular mobile communication terminals. 

In the fifth contribution, G. Miet describes future implementations of speech codecs 
and their impact on the system complexity. 

The last paper of this part, authored by R. L. J. Roovers, investigates wide-band 
analog-to-digital converters, which are fundamental building blocks of multi-channel 
receivers. 
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Abstract: The complexity of DSP systems grows at an ever-increasing rate while 

the implementation of these designs must meet criteria like minimum cost and a short 
time to market. Hence there is a growing need for efficient design automation and a 
seamless design flow that allows the execution of the design steps at the highest suitable 
level of abstraction. 

Especially for the design of fixed-point algorithms and systems, tool support is very 
rare, though for most digital systems the design has to result in a fixed-point implemen- 
tation. This is due to the fact that these systems are sensitive to power consumption, 
chip size and price per device. Fixed-point realizations outperform floating-point re- 
alizations by far with regard to these criteria. On the other hand, algorithm design 
starts from a floating-point description in order to initially abstract from all fixed-point 
effects. The resulting gap between the floating-point prototype and the fixed-point im- 
plementation represents one of the major bottlenecks in today’s digital designs. 

This paper will give a survey of FRIDGE 1 , a tool suite that permits a seamless de- 
sign flow, starting from an ANSI-C floating-point algorithm which is then converted to 
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a fixed-point description in fixed-C 2 . This bit-true description of the algorithm can fi- 
nally be mapped to different implementation targets, programmable DSPs or dedicated 
hardware structures. 

21.1 INTRODUCTION 

Digital system design is characterised by ever-increasing complexity and ever-tighter 
schedules, resulting in minimum costs and short time to market. This requires a seam- 
less design flow that allows the execution of the design steps at the highest suitable 
level of abstraction. 

For most digital systems, the design has to result in a fixed-point implementation, 
either in hardware or software. This is due to the fact that digital systems have to be op- 
timised with regard to power consumption, chip size, throughput and price per-device. 
Fixed-point realizations outperform floating-point realizations by far with regard to 
these criteria. 
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Figure 21.1 Fixed-point design process. 



A typical fixed-point design flow is shown in Fig. 21.1. Algorithm design starts 
from a floating-point description that is analysed by means of simulation. This abstrac- 
tion from all implementation effects allows an evaluation of the algorithm space with- 



2 fixed-C is a Fixed-Point Language based on ANSI-C, but with additional generic fixed-point data 
types. 
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out taking into account the quantisation effects on the algorithmic behaviour. Thus, 
modeling efficiency is higher at the floating-point level, and the use of floating-point 
models offers a maximum degree of re-usability. 

The transformation to the fixed-point level is quite tedious and error-prone, since a 
fixed word length and a fixed exponent needs to be assigned to every operand man- 
ually. To provide a more quantitative picture of this time-consuming process, in 
Fig. 21.1 we exemplarily displayed the design times for the implementation steps for 
an OFDM receiver frequency synchronisation unit [1] developed at ISS. In much the 
same way as for this example, often more than 50 % of the design time is spent on 
the algorithmic transformation to the fixed-point level for complex designs once the 
floating-point model has been specified. The major reasons for this bottleneck are: 

■ There is no unique transformation from floating-point to fixed-point. 

1. Different HW and SW targets put different constraints on the fixed-point speci- 
fication. 

2. Optimisation for different design criteria, like throughput, chip size, memory 
size, or accuracy, are in general mutually exclusive goals and result in a complex 
design space as sketched in Fig. 21.2. 

3. The quantisation is generally highly dependent on the application, i.e. on the 
applied test vectors. 
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Figure 21 .2 Fixed-point design space. 



■ The manual transformation of the floating-point algorithm to the fixed-point algo- 
rithm is known to be time-consuming and error-prone. Even for a single transfor- 
mation, modeling efficiency is very low. Therefore the manual floating-to-fixed 
conversion appears to be no longer acceptable. 

■ The fixed-point simulation efficiency is low. This is due to the fact that the bit-true 
behaviour of the target system has to be emulated on the host machine. 

These inefficiencies have been the motivation for FRIDGE [2-4], an interactive 
design environment for the specification, simulation and implementation of fixed-point 
systems. 
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In this paper we describe the principles and elements of FRIDGE and ou t line 
the seamless design flow which becomes possible using this design environment. 
We also give a global picture of the FRIDGE design flow by highlighting its con- 
stituents, briefly describing the fixed-point language fixed-C and the way to annotate 
operands with fixed-point attributes. Later we show the core algorithm of FRIDGE, 
the wordlength interpolation and the back ends, i.e. mapping the internal fixed-point 
representation of FRIDGE to a synthesisable/compilable description. 

21.2 THE FRIDGE DESIGN FLOW 

As pointed out above, a manual transformation into a fixed-point representation is 
hardly feasible. Since an efficient evaluation of the complex design space requires 
multiple transformations, this is even more of a design bottleneck. 

In the FRIDGE design environment, system design starts from a floating-point al- 
gorithm in ANSI-C. The designer then may annotate single operands with fixed-point 
attributes, as illustrated in Fig. 21.3. 




Figure 21.3 Quantisation methodology with FRIDGE. 



Inserting these local annotations results in a hybrid description of the algorithm, 
i.e. some of the operands are specified bit-true, while the rest are still floating-point. 
A comparative simulation of the floating-point and the hybrid code within the same 
simulation environment shows whether the local annotations are appropriate, or if 
some annotations have to be modified. 

Once the hybrid program matches the design criteria, the remaining floating-point 
operands are transferred to fixed-point operands by interpolation, where interpolation 
denotes the process of computing the fixed-point parameters of the non-annotated 
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operands from the information that is inherent to the annotated operands and the oper- 
ations performed on them. Additionally, the interpolator has to observe a set of global 
annotations, i.e. default restrictions for the calculation of fixed-point parameters. This 
can be e.g. a default maximum wordlength that corresponds to the register length of 
the target processor. 

The interpolation supplies a fully annotated program, where each operand and oper- 
ation is specified in a bit-true way. Co-simulating this algorithm with the original float- 
ing-point code will give an accuracy evaluation — and for any alterations only some of 
the local and global annotations have to be modified, while the rest is determined and 
kept consistent by the interpolator. 

What is described above are the algorithmic level transformations as illustrated 
in Fig. 21.1 which change the behaviour or accuracy of an algorithm. The resulting 
completely bit-true algorithm in fixed-C is not yet suitable for implementation, thus 
it has to be mapped to a target, e.g. a processor’s architecture or an ASIC. This is 
an implementation level transformation, where the bit-true behaviour normally stays 
unchanged. Within the FRIDGE environment different back ends map the internal 
bit-true specification to different formats/targets, according to the purpose or goal of 
the quantisation process: 

■ ANSI-C or fast simulation code 

■ Processor specific code 

■ HDL code, e.g. behavioural VHDL 

This framework allows a seamless design flow from algorithm development to im- 
plementation and therefore tears down the well-known virtual brick wall that is often 
to be found between system level design and implementation. 

21.3 FIXED-C AND LOCAL ANNOTATIONS 

Since ANSI-C offers no efficient support for fixed-point data types [5,6], we developed 
the fixed-point language fixed-C, which is a superset of the ANSI-C language. It 
comprises different generic fixed-point data types, casting operators and interpolator 
directives. These properties enable fixed-C to be used for different purposes in the 
FRIDGE design flow: 

■ Since ANSI-C is a subset of ftxed-C, the additional fixed-point constructs can be 
used as bit-true annotations to dedicated operands of the original floating-point 
ANSI-C file, resulting in a hybrid specification. This partially bit-true code can be 
used for simulation or as input to the interpolator. 

■ The bit-tme output of the interpolator is represented in fixed-C as well. This allows 
maximum transparency of the results to the designer, since the changes to the code 
are reduced to a minimum and the effects of the designer’s directives, such as local 
annotations in the hybrid code, become directly visible. 

The language constructs to describe the bit-true behaviour are contained in a C++ 
class library which can be used in any design and simulation environment that comes 
with a C or C++ compiler, such as COSSAP [7], SPW [8] or Matlab/Simulink [9]. 

For a bit-true and implementation-independent specification of an operand, a three- 
tuple is necessary, the wordlength wl, the integer wordlength iwl, and the sign s, as 
illustrated in Fig. 21.4. 
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Figure 21 .4 Fixed-point attributes of a bit-true description. 



21.3.1 The data type Fixed 

declaration: Fixeckwl , iwl , sign> d, *e,g[8] ; 

The data type Fixed receives its format when it is declared, thus a Fixed vari- 
able behaves according to these fixed-point parameters throughout its lifetime. This 
concept is called declaration-time instantiation (DTI). Similar concepts exist in other 
fixed-point languages as well [10-12]. Pointers and arrays, as frequently used in 
ANSI-C, are supported as well. 

For every assignment to a Fixed variable, a data type check is performed: 
Example: 

Fixed<6,3,s> a,b; 

Fixed<12 , 12 , u> c; 

a = b; /* correct, both types match */ 
c = b; /* type mismatch */ 

Fixed is the data type of choice e.g. for interfaces to other functionalities or for 
look-up tables, since it behaves like a memory location of a specific length. 

21.3.2 The data type fixed 

In addition to the DTI data type concept, fixed-C provides the data type fixed which 
is specially tailored for the float-to-fixed transformation process. 

declaration: fixed a, *b, c [ 8 ] ; 

A variable is declared to be of data type fixed, but no instantiation is performed 
and no fixed-point attributes are specified. 

casting: a=fixed(wl, iwl, sign, cast, *b) ; 

The variable a receives data of type <wl, iwl, sign>, the value of *b is cast 
according to the casting mode cast. The casting mode specifies how to handle over- 
flow 3 as well as quantisation 4 . The different casting modes are shown in Tab. 21.1. 



^Overflow handling specifies the behaviour in case of a wordlength reduction at the MSB side. 
4 Quantisation handling specifies the behaviour in case of a wordlength reduction at the LSB side. 
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Table 21 .1 Different modes for cast to f ixed. 



casting modes 


overflow handling 


overflow handling 




saturation 


wrap around 


quantisation by rounding 


sr 


wr 


quantisation by truncation 


st 


wt 



Every assignment to a variable overwrites all prior instantiations, i.e. one fixed 
variable may have different context-specific bit-true attributes in the same scope. This 
concept of Assignment-Time Instantiation (ATI) is motivated by the specific design 
flow: transformation starts from a floating-point program, where the designer abstracts 
from the fixed-point problems and does not think of a variable as a finite length regis- 
ter. Fig. 21.5 highlights the properties of ATI in a quantisation process. 



floating-point fixed-point assignment time instantiation (ATI) 




Figure 21 .5 Example of assignment-time instantiation. 



For the first assignment to a, a fixed-point format <1 , -2 , u> is sufficient to repre- 
sent the constant 0. 125, while for the second assignment a format <16,16, s > would 
be necessary. With ATI, this information is accessible without code changes, while for 
DTI, two options exist: 

■ merging over the fixed -point parameters for all assignments to the variable a to re- 
ceive a super-format that matches the fixed-point requirements wherever a is used. 5 
If this super-format were the input to an implementation tool, it would force the use 
of large registers, even when this is not necessary. 

■ to use the context-specific information, a variable must be split up manually into 
several different variables, which is a time-consuming and error-prone task as well. 

In contrast, the concept of local annotations and ATI is an effective way to assign 
context-specific information without changing structures or variables when exploring 
the fixed-point design space. 



5 For the example in Fig. 21.5: 2 fractional bits from the first assignment to a, 16 integer bits from the 
second, so the merged format is 18 bits long. 
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21.3.3 Interpolator directives 

The interpolator can extract information about the fixed-point parameters not only 
from direct information about wordlength, integer wordlength and sign, 6 but also from 
indirect or user-supplied information. For example one could specify only the range 
of an operand: 

a=f ixed (minim ( -2 . 35) , maxim (4 . 74) , *b ); 

This information is not sufficient for a bit-true simulation, since there is no information 
extractable e.g. about the quantisation steps. Annotations like this are called incom- 
plete local annotation or interpolator directives. They are ignored by the simulator, 
but can provide additional information to the interpolation process. 

21.4 INTERPOLATION 

The interpolator is the core of the FRIDGE design environment. As shown in Fig. 21.3, 
it determines the fixed-point formats for all operands of an algorithm, taking as input 
a user-annotated hybrid description of the algorithm and a set of global default rules, 
the global annotation file. Hence interpolation describes the computation of the fixed- 
point parameters of the non-annotated operands from the information that is inherent 
to the annotated operands. 

21.4. 1 Maximum precision interpolation 

The interpolation concept is based on three key ideas: 

Attribute propagation: The means of using the information inherent to the hybrid 
code to determine the fixed-point attributes of all operands. 

Global annotations: The description of default rules and restrictions for attribute 
propagation. 

Designer support: The interpolator supplies various feedback or reports to assist the 
designer to debug or improve the interpolation result. 

For a better understanding, the first two points are now explained in more detail: 

Attribute propagation. Given the information of the fixed-point attributes of some 
operands, the type and the fixed-point format of other operands can be extracted from 
that. For example, if both the range and the relevant fractional wordlength are specified 
for the inputs to an operation, the same attributes can be determined for the result. 7 A 
simple example for an addition is shown in Fig. 21.6. 

Given the user-annotated ranges of variables a and b, FRIDGE automatically de- 
termines the range and the accuracy, i.e. the required number of fractional bits for the 
variable c. Note that this is a worst-case or maximum-precision interpolation (MPI), 



6 As provided by casting operators described earlier. 

7 An exception is the division, where the accuracy of the operation must be specified as well. 
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, a e [-0.25,0.75] 

/ <3,1,s> 

user-annotated 
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Os e e [-2.25,1 .875^ 
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-U d = 1.5 

1.5 <2,1,u> 



Figure 21.6 Example for interpolation of ranges/wordlengths. 



so there is no way an overflow can occur. LSBs are only discarded if the wordlength 
exceeds the maximum wordlength specified in the global annotation file. 

The fixed-point parameters iwl and sign of an operand can be determined once its 
range [min, max] is known 8 [13]: 

. I sign = s 

min < | > max[ld|mm|,ld(|majc| +2~f wl )\ + 1 

o - - n 1 sign = u 

2. mm _ 0 j iw[ > ld(max + 2->') ’ 

Without the additional knowledge of fwl one has to spend one additional bit to 
guarantee that the maximum (minimum) can be represented. 

wl is determined using the information about the fractional wordlength fwl = wl- 
iwl. It is only necessary to represent those fractional bits which actually carry any 
information. 

Global annotations. While local annotations express fixed-point information for 
single operands, global annotations describe default restrictions to the complete de- 
sign. For different targets, different global restrictions apply. For SW, the functional 
units to perform specific operations are already defined by the processor’s architec- 
ture. Consider a 16 x 16 bit multiplier writing to a 32-bit register. A global annotation 
can inform the interpolator that the wordlength of a multiplication operand may not 
exceed 16 bits, while the result may have a wordlength of up to 32 bits. 

21.4.2 Utilisation of statistical knowledge for Interpolation 

With the MPI, the wordlengths of the operands increase considerably during interpo- 
lation, while in many cases the obtained large wordlengths do not correspond to an 
adequate precision. This is due to the fact that each fixed-point input x to the algo- 
rithm represents a quantised value .r 0 , i.e. it consists of the exact value .r 0 and the 
quantisation error Ax, 



x = jco + Ajc. 



( 21 . 1 ) 



8 We assume a two’s complement representation here. 
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Taking this fact into account, one can compute a noise floor that can be used to judge 
the information content of each bit. The statistical properties of the noise are prop- 
agated by the interpolative approach described above. Assuming the error PDF is 
known for some operands, e.g. for the input, the resulting noise probability density 
function (PDF) at every point of the algorithm is calculated. In principle, the PDF of 
a linear arithmetic operation can be calculated if the input PDFs are known [14]. 9 

To cut down this model to an implementable complexity we only take a few para- 
meters to describe the quantisation error of a variable x, expressed by its mean value 
Mi, and variance <j^ x . The propagation of these parameters is governed by a set of 
rules and the mean value m Xo and variance o XQ of each signal [15]. 

The effect ofLSB reduction. The criterion for a wordlength reduction is its impact 
on the noise PDF. In other words: What is the deterioration of the ratio of signal 
power and quantisation noise (SNR) for a single operation (A SNR op ) if we skip the 
least significant bit (LSB)? If one additionally defines a global threshold ASNRgi that 
must not be exceeded when reducing the wordlength, one can decide whether the LSB 
has to be preserved or not. 

The PDF of the truncation error consists of two unit pulses at 0 and 
weighted with the set probabilities P(0) and P(l): 

p(Ax) = P(0)8(Ax)+ PiDSiAx + l-W). (21.2) 

Thus the noise power and the loss of the signal-to-noise ratio for this operation 
(A SNR op ) can be calculated: 
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A SNR op therefore is a quantitative measure for the information content of the LSB. 
According to a user-specified ASNRgi threshold, the wordlength of an operand is either 
reduced or not, i.e. only if the calculated ASNR op i& lower than A SNR g i, the LSB gets 
truncated. This is repeated recursively unless the sum of all calculated values ASNR op 
exceeds the ASNR g i threshold. 

This optional interpolation method allows quantitative assessment of the impact of 
quantisation on the precision of operations and algorithms and permits the wordlengths 
of the algorithm to be kept comparatively short [15]. With the SNR criterion, the 
result is an optimised ratio of algorithm performance and the wordlengths used in the 
algorithm. 

21.5 BACK ENDS 

As stated above, the fixed-C output of the interpolator is very well-suited for algorithm 
design or for a floating-to-fixed transformation. Nevertheless, it is not directly suited 



’For example, the PDF of a sum of two statistically independent variables is obtained by convolving 
the PDFs of these variables. 
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for implementation for HW Synthesis tools or for programmable architectures. Apart 
from the fact that there are as of yet only very few implementation tools that support 
the fixed-C language [16], there are other reasons that the fixed-C output is not directly 
suited for implementation: 

■ The fixed-C code contains all context-specific fixed-point information and retains 
the structure of the original floating-point code. For implementation purposes, most 
information is not directly applicable, but is has to be refined. Often the structure 
of the code has to be changed according to this information, according to the im- 
plementation target and to the design goals. 

■ The ATI concept is not supported by most imperative implementation languages, 
where each variable corresponds to some storage location. 

■ The weight of bit pattern, i.e. the location ofthe binary point (lbp) (see Fig. 21.4) in 
a finite register is handled automatically by the fixed-point data types of the fixed-C 
language. This is no longer the case for more implementation-related representa- 
tions, like ANSI-C or VHDL. 

Hence, a back end which maps the implementation-independent fixed-C represen- 
tation to a specific target is necessary. It has to deal with embedding the fixed-point 
operands into registers, mapping them to signals or variables, optimising the code 
structures for the implementation architecture, etc. 

Currently there are different research and development activities going on at the 
Institute for Integrated Signal Processing Systems. The main implementation targets 
are 

■ ANSI-C and fast-simulation back end, 

■ Processor-specific back end, 

■ Hardware-description-language back end. 

To describe all of those back ends would be clearly out of the scope of this paper. 
Hence we focus on the ANSI-C and fast simulation back end as examples to highlight 
some problems and solutions found in this area. 

21.5.1 ANSI-C and fast-simulation back end 

The ANSI-C output of this back end serves two purposes: the implementation on pro- 
grammable processors that come with a C compiler, and reduction of simulation time 
on a host machine. Fixed-point simulation based on a C++ class library as described 
earlier or in [8,12,17] increases the runtime by one or even two orders of magnitude 
compared to the corresponding floating-point simulation. This is due to the fact that 
the fixed-point specification has to be emulated on the host machine that supports dif- 
ferent data types from the target. 

The key idea for a fast fixed-point simulation is to take advantage of all compile- 
time 10 information to minimise the processing effort that has to be spent at simulation- 
time. Two areas for compile-time optimisation have been integrated into FRIDGE: 



,0 In this case, a floating-point algorithm is compiled into a fixed-point representation, i.e. compile-time 
means interpolation-time here. 




